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Introduction 


0.1 Background 

First-order logic is weak, it is incapable of making inductive definitions. 
There are extensions of first-order logic in which inductive definitions are 
possible, namely, fixed-point extensions. For example, first-order logic 
cannot express the path relation, which is the transitive closure of the 
edge relation, in graphs. Actually it is unable to express transitive closure 
in general as noted by Aho and Ullman in 1979 (see pQ and [5], in fact 
they noted this for relational calculus, which, from the point of view 
of expressive power, is exactly first-order logic). They then, Aho and 
Ullman, suggested extending the relational calculus by adding the least 
fixed-point operator. 

There are two fields where fixed-point extensions of first-order logic 
were extensively studied earlier. One is the theory of inductive defini¬ 
tions (for example by Moschovakis [T3], 1974). The other is semantics of 
programming languages where a fixed-point extension of first-order logic 
is known as first-order /i-calculus. But neither of the two fields put finite 
structures into the center of attention. 

An inductive definition needs a certain number of iterations before 
it closes; this number is called the depth. Immerman gave an elaborate 
definition of depth for positive inductive definitions over finite structures 
(see m), then he proved that inductive depth equals parallel-time and 
equals the depth of circuit in Circuit Complexity. 

He also introduced another view of depth as the number of iterations 
of a quantifier block. He proved that the complexity class P is exactly 
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the set of boolean queries expressible by first-order quantifier blocks it¬ 
erated polynomially and PSPACE is exactly the set of boolean queries 
expressible by first-order quantifier blocks iterated exponentially. 

The theorems proved by Immerman showed the importance of depth 
as a complexity measure. Here we study depth. 


0.2 Summary 

The thesis consists of three chapters. In the first chapter we introduce the 
preliminary definitions and facts we need from logic and complexity. The 
last section is on complexity, and the first section is for extra notations 
that are not established within the definitions of the thesis. The middle 
three sections are about finite structures, first-order logic and its fixed- 
point extensions, the Ehrenfeucht-Frai'sse game and its importance in 
proving non-expressibility results. 

Particularly, at the end of the third section, we mention an exam¬ 
ple from [2] using the algebraic version of the game in proving non- 
expressibility of connectivity in first-order logic, and then, in the fourth 
section, we present fixed-point extensions of first-order logic in which con¬ 
nectivity is expressible, or in fact, in which the path relation, which is the 
transitive closure of the edge relation, in graphs, and transitive closure in 
general, and more complicated kinds of recursion, are expressible. 

The second section is devoted to finite structures, especially, graphs 
and binary strings. We deal here with finite structures only because the 
objects computers have and hold are always finite. Inputs, databases, 
programs are all finite objects that can be conveniently modeled as fi¬ 
nite logical structures. Binary strings are important because every finite 
ordered structure can be coded as a binary string, and this is how the 
structure is introduced as an input to the Turing machine. 

Graphs are important because every finite structure may be thought 
of as a graph, or as Immerman expressed it: "Everything is a Graph" 
Dm. Later in the first chapter we will be able to define precisely what we 
mean by saying that every finite structure may be thought of as a graph. 
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Graphs are important also because the computation of a Turing ma¬ 
chine can be represented as a graph, called the configuration graph, in 
which vertices represent possible configurations (the sum of machine’s 
state and work-tape inscription and positions of heads) of the machine 
and edges represent the possibility of a transition, in one step, by the 
machine’s transition function from that configuration to the other. 

Those configuration graphs showed their importance in the proofs of 
completeness of particular graph queries for certain complexity classes. 
For example, in the proofs of the completeness of reachability for nonde- 
terministic logspace and the completeness of alternating reachability for 
deterministic polynomial time, one obtains from a structure (being tested 
by the machine for its satisfaction of a query from one of these complexity 
classes) the configuration graph of the machine’s computation on it and 
then the problem is proved to be reducible to or equivalent to testing 
that graph for its satisfaction of the respective graph query. We used 
configuration graphs in the proof of theorem 13.3.101 

Another importance of graphs is that one of the most famous models 
of computation and its measures of complexity are defined in terms of 
graphs and properties of graphs, namely, Boolean Circuits, and measuring 
complexity via their depths. 

The second chapter is devoted to the notion of depth of inductive 
definitions over finite structures. Immerman defined depth for positive 
formulas only. Here we define depth for all formulas which have a fixed- 
point and set it to be oo if the fixed-point does not exist. 

We also define the inflationary-depth of a formula as the depth of the 
inflationary formula obtained from it. These definitions are in the first 
section then, in the second section, we exhibit the basic theorems relating 
depth to complexity classes from Immerman’s "Descriptive Complexity" 

DEB- 

In the last section we investigate the relationship between the depth 
of a formula and its inflationary-depth and find that no one of them 
is always greater than the other, i.e., there are formulas for which the 
depth is greater than the inflationary-depth and formulas for which the 
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inflationary-depth is greater than the depth. 

We investigate also the relationship between the depth of the simul¬ 
taneous fixed-point of two formulas and the depth of their nested fixed- 
point and show that no one of them is always greater than the other if 
the relation variables through which the fixed-point is computed are not 
restricted to be positive. 

What motivated us to compare the depth of the nested fixed-point and 
the depth of the simultaneous fixed-point of the same two formulas is that 
in the proof of Lemma 8.2.6 of f2j they show that under some conditions 
on two formulas ip and -0, we have that their nested fixed-point equals 
their simultaneous fixed-point. We wanted to see which one of the two 
ways, nested or simultaneous, to reach that fixed-point, consumes more 
time or needs a bigger number of iterations. 

In the third chapter, in the first section, we present finite variable 
logics and their respective games (pebble games). In the second section 
we study the expressive power of number of variables and quantifier rank, 
motivated by the theorems, exhibited in Section 2.2 from Immerman’s 
"Descriptive Complexity", relating respectively the number of variables 
in an inductive definition and its depth (which turns out to be equal to 
the number of iterations of a quantifier block) to the number of processors 
in a parallel machine and its parallel-time. 

We prove that there is a property of words expressible with two vari¬ 
ables and quantifier rank 2 n + 2 but not expressible with quantifier rank 
n with any number of variables (Proposition 13.2.31) . 

In the third section we focus on inhnitary finite variable logic 
which contains fixed-point logic and we introduce a rough relationship 
between the depth of an inductive definition in structures of size n and the 
number of V-symbols in the first-order formula expressing that inductive 
definition in structures of size n. 

We introduce a new complexity measure FOy[f (ri), g(n)] which counts 
the number, f(n), of V-symbols, and the number, g(ri), of variables, in 
first-order formulas needed to express a given property. We prove that 
for f(n) > logn, NSPACE[/(n)] C FO v [/(n) + (g)\g;J, and that 
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for any f(n),g(n), FO y[f(n),g(n)] C DSPACE[/(n)^(n) logn]. 

In the last section we talk more about pebble games and depth. We 
ask two questions and suggest a conjecture concerning depth. 

0.3 List of Results 

(1) Examples showing that, in general, there is no order comparison 
between the depth of a formula and its inflationary depth, and no 
order comparison between the depth of the iteration it takes to come 
to the simultaneous fixed-point and the depth of the iteration it 
takes to come to the nested fixed-point of two formulas if the relation 
variables through which the induction is made are not restricted to 
be positive. These examples are exhibited in Section 2.3 which is 
devoted for them, and therefore, titled Examples. 

(2) Proposition 13.2.31 where we prove that there is a property of words 
expressible with two variables and quantifier rank 2 n + 2 but not 
expressible with quantifier rank n with any number of variables. 

(3) Theorem 13.3.101 where we prove that for f(n ) > logn, 

NSPACE^CFOvi/W+^y.M], 

(4) Theorem 13.3.111 where we prove that for any f{n),g(n), 

F0 y[f(n),g(n)} C DSPACE[/(n)#(n) logn]. 


5 










1 


Preliminaries 


This is a chapter with the preliminaries we need from logic and complexity. 
It is mainly from (2j and mu- 

1.1 Notation 

Most of our notation is from [2] and Pi- Almost all notations are estab¬ 
lished within the definitions in the thesis, except a few things which we 
mention here. 

We write := to mean equals by definition. 

N denotes the natural numbers {0,1,...}. 

We write \ to denote the difference of sets, i.e., A\B := {x G A \ x ^ B}. 
For a finite set A, we write ||A|| to denote the number of elements in it. 
For a function /, we write do(f) and rg(f ) to denote its domain and 
range respectively. We write ( f) r (x ) to denote the result of applying / 

r times 

r times on x, i.e., /(/(... f(x) ...)). For two functions f,g from N to N, 
we write f(n) = 0(g(n)) to mean that there is a constant k such that 
f(ri) < k ■ g(n ) for every sufficiently large n, and write f(n) = Q(g(n)) 
to mean that f[n) = 0(g(n )) and g{n) = 0(f(n)). 

For a binary relation ~, we write ^ to denote its complement. 

We write x to denote the tuple (xi,... ,x n ) when n is understood from 
the context, and when a and b are of the same length n, we write oi->6to 
denote the map from {ai,..., a n } to {b\, ..., b n } under which the image 
of a* is bi for every i £ {1,..., n}. 

k times 

For a symbol s we sometimes write s k to denote the string s s... s. 
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We sometimes write M(w ) 4- to mean that Turing Machine M accepts 
input w. 


1.2 Finite Structures 

Definition 1.2.1. (Structures) 

(1) Vocabularies are finite sets that consist of relation symbols P,Q,R,... 
and constant symbols (for short: constants ) c, d, .... Every relation 
symbol is equipped with a natural number > 1, its arity. We denote 
vocabularies by r, a,.. .. A vocabulary is relational , if it does not 
contain constants. 

(2) A structure A of vocabulary r (by short: a r-structure) consists of 
a nonempty set A, the universe or domain of A (also denoted by 
|M|), of an n-ary relation R A on A for every n-ary relation symbol 
R in r, and of an element c A of A for every constant c in r. 

(3) An n-ary relation S on A is a subset of A n , the set of n-tuples of 
elements of A. We mostly write Sa\... a n instead of (cq,..., a n ) G 

S. 

(4) A structure A is finite, if its universe A is a finite set. We denote 
the cardinality of A if it is finite by ||M||. 

Proviso 1.2.2. All structures in the thesis are assumed to be finite. 

Definition 1.2.3. (Orderings) 

(a) Let t = {<} with a binary relation symbol <. A r-structure 
A = (A, <' A ) is called an ordering if for all a, b, c G A : 

(1) not a < A a. 

(2) a < A b or b < A a or a = b. 

(3) if a < A b and b < A c then a < A c. 
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(b) Let S' be a binary relation symbol (representing the successor rela¬ 
tion), and min and max constants (for the first and last element of 
the ordering). A finite {<, S, min, max}-structure A is an ordering 
if, in addition to (1), (2), (3), for all a, b G A : 

(4) S A ab iff (a < A b and for all c, if a < A c then b < A c or b = c). 

(5) min A < A a or min A = a. 

(6) a < A max A or a = max A . 

(c) Suppose that To is a vocabulary with {<} C tq C {<, S, min, max} 
and let cr be an arbitrary vocabulary with To C a. A finite cr- 
structure A is said to be ordered, if the reduct A\tq (i.e., the re¬ 
structure obtained from *4.) by forgetting the interpretations of the 
symbols in cr\ro) is an ordering. 

Proviso 1.2.4. Unless stated otherwise, we identify the universe of struc¬ 
tures of size n with {0,..., n — 1}, and for orderings we assume that < 
is interpreted with the usual ordering inherited from N. 

Definition 1.2.5. (Isomorphism) Fixing a vocabulary r, two r-structures 
A and B are isomorphic, written A = B if there is an isomorphism from A 
to B, i.e., a bijection n : A —> B preserving relations and constants, that is, 
for any n-ary R G r and a\,..., a n G A, R A a\... a n iff R B 7r(a\)... 7r (a n ) 
and, for any constant c G r, 7r(c A ) = c b . 

Definition 1.2.6. (Partial Isomorphisms) 

Assume A and B are structures. Let p be a map with do(p) C A and 
rg(p) C B, where do(p) and rg(p) denote the domain and the range of p, 
respectively. Then p is said to be a partial isomorphism from A to B if 

- p is injective. 

- for every c G r : c A G do(p) and p(c A ) = c b . 

- for every n-ary R G r and all a 1; ..., a n G do(p), 

R A a\...a n iff R B p(a\).. .p{a n ). 


9 


We write Part (-4, B) for the set of partial isomorphisms from A to B. 
Definition 1.2.7. (Disjoint Union) 

For relational r, the disjoint union of r-structures A and B with AnB = 0 
is the structure AJB with universe AU B and 

j^Aub ._ f>A u jj>n f or any R in r. 

In case A and B are structures with AnB ^ 0, we take isomorphic copies 
A! of A and B' of B with disjoint universes (e.g., with universes A x {1} 
and B x {2}) and set AJB := A'ClB'. 

Now we go to a particular kind of structures called graphs. Visu¬ 
ally speaking, a graph is a collection of points or vertices linked by line 
segments or edges. 

Definition 1.2.8. (Graphs) 

Let r = {E} where If is a binary relation symbol. A graph (or, undirected 
graph) is a r-structure Q = [G, E s ) satisfying 

(1) for all a E G : not E^aa. 

(2) for all o,ft6G: if E^ab then E^ba. 

By GRAPH we denote the class of graphs. If only (1) is required, we speak 
of a digraph (or, directed graph). The elements of G are sometimes called 
points or vertices, the elements of E s edges. 

Definition 1.2.9. 

(1) Let Q be a digraph. If n > 1 and 

E s a (i a\, E^a\a2, ■ ■ ■ , E^a n _\a n 

then ao,,a n is a path from ao to a n of length n (in undirected 
graphs this is also a path from a n to ao). 

(2) If a 0 = a n then a 0 ,... ,a n is a cycle. Q is acyclic if it has no cycle. 
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(3) Let Q be a graph. Write a ~ b if a = b or if there is a path from a 
to b. Clearly, ~ is an equivalence relation. The equivalence class of 
a is called the ( connected ) component of a. Q is connected if a ~ b 
for all a,b G G, that is, if there is only one connected component. 
Let CONN be the class of connected graphs. 

(4) Denote by d(a, b) the shortest length of a path from a to 6; more 
precisely, define the distance function d : G x G -)■ N U {oo} by 

d(a,b) = oo iff a ^ b ; d(a,b) — 0 iff a — b ; 
and otherwise, 

d(a, b) = min{n > 1 | there is a path from a to b of length n}. 

Having seen graphs let us now define strings and see how every ordered 
structure can be coded as a binary string (which makes it possible for a 
structure to be introduced as an input for a Turing machine). 

Definition 1.2.10. A string (also called a word ) of length n on k > 1 
symbols is an ordered structure of size n, over the vocabulary 
r = {< , R \,..., Rk}-, in which the unary relation Ri, 1 < i < k, is 
interpreted with the set of elements that represent positions in which 
the Ath symbol occurs. (Of course a string on k symbols can also be 
represented as an ordered structure on a vocabulary with just k — 1 unary 
relation symbols). 

For example the binary string w = 011001 is represented by the follow¬ 
ing ordered structure : W = (W, < w , R w ), where W = {0,1, 2, 3,4, 5}, 
and R w = {1,2,5} (the positions in which 1 occurs). 

Definition 1.2.11. (Binary Encoding of Structures) 

Let r = {<, Ri ,..., R r , ci,..., c s } where each Ri is of arity a*, and let 
A = ({0,1,..., n — 1}, < A , Rf ,..., Rf, cj 4 ,..., cf) be an ordered struc¬ 
ture of vocabulary r. The relation Rf is a subset of |-4.| a % and this 
contains exactly n ai tuples. We encode this relation as a binary string 
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bin^^Ri) of length n ai where “1” in a given position indicates that the 
corresponding tuple is in Rf. Similarly, for each constant cjh its num¬ 
ber is encoded as a binary string bin A (cj) of length [log rtj (this binary 
string is its normal representation in binary). The binary encoding of the 
structure A is then just the concatenation of these binary strings coding 
its relations and constants, 

bin(A) = bin A (Ri)bin' A '(R 2 ) ■ ■ ■ bin' A '{R r )bin' A '{c i)... bin A (c s ) 

We do not need any separators between the various relations and con¬ 
stants because the vocabulary r and the length of bin(A) determine where 
each section belongs. Observe that the length of bin(A) is given by 

||&m(*4)|| = n ai + ... + n ar + s |~logn] 

In the special case where r includes no relation symbols other than <, 
we pretend that there is a unary relation symbol that is always false. For 
example, if r = {<}, then bin(A) = O^L We do this to insure that the 
size of bin(A) is at least as large as ||.4||. 

Example 1.2.12. Consider the graph Q := (G, E s , s g ,f e ), where, G = 
{0,1, 2}, E s = {(0,1), (1, 2)}, s s = 0 and t Q = 2. Then its binary code is 
010001000010. 

1.3 First-Order Logic and Games 

Definition 1.3.1. (Syntax of First-Order Logic) Fix a vocabulary r. 
Each formula of first-order logic will be a string of symbols taken from 
the alphabet consisting of 

- Vi, V 2 , v 3 ,... (the variables ) 

- -i, V (the connectives not, or) 

- 3 (the existential quantifier) 
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- = (the equality symbol ) 

- ),( 

- the symbols in r. 

A term of vocabulary r is a variable or a constant in r. Henceforth, 
we often shall use the letters x,y,z,... for variables and t,ti,... for terms. 
The formulas of first-order logic of vocabulary r are those strings which 
are obtained by finitely many applications of the following rules: 

(FI) If to and t\ are terms then to — ti is a formula. 

(F2) If R in r is n-ary and t\,...,t n are terms then Rt±... Rt n is a 
formula. 

(F3) If p is a formula then —>p is a formula. 

(F4) If p and ip are formulas then (p V ip) is a formula. 

(F5) If p is a formula and x a variable then 3xp is a formula. 

Formulas obtained by (FI) or (F2) are called atomic formulas. For formu¬ 
las <p and ip we use (p A ip), (p —> ip), (p -B- ip), and \/xp as abbreviations 
for the formulas ~ i (->p V ~ 1 '0), ( _| ^V'0), {{->p\/ip) A(~np V </?)), and ->3x~<p, 
respectively. 

Denote by FO[r] the set of formulas of first-order logic of vocabulary r; 
and denote it by just FO when the vocabulary is understood from the 
context. 

Definition 1.3.2. (free variables) 

free(p) the set of free variables of a formula p : 

- if p is atomic then the set freefp) is the set of variables occurring 
in p. 

- free(->p) := free{p) 

- free(p V ip) := free(p) U free (ip) 
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- free(3xip) := free(<p)\{x}. 

We write tp(xi,..., x n ) to indicate that x±,... ,x n are distinct and 
free(<p) C {aq,... ,x n } without implying that all Xi are actually free in 
<p. A bound occurrence of a variable in a formula is an occurrence that 
lies in the scope of a corresponding quantifier. A sentence is a formula 
without free variables. 

Definition 1.3.3. (Semantics of First-Order Logic) 

Let A be a r-structure. An assignment in A is a function a. with domain 
the set of variables and with values in A, a : {v n \ n > 1} —» A. Extend a 
to a function defined for all terms by setting a(c) := e A for all constants 
in r. Denote by a^ the assignment that agrees with a on all variables 
except that a-(x) = a. 

We define the relation A t= <£>[a], called the satisfaction relation (“the as¬ 
signment a satisfies the formula ip in A" or u ip is true in A under a"), as 
follows: 


A b 

h = t 2 [a\ 

iff 

aitfi = 

a(t 2 ) 

A b 

Rt 1 ...t n [a] 

iff 

R A a(ti] 

). ,.(y.{tn) 

A b 


iff 

not A t= 

-- ip[a} 

A b 

((pV'fifia] 

iff 

A b tp[a\ or A b ^[a] 

A b 

3 xip[a\ 

iff 

there is 

an a G A such that A b (p[a^ 


Note that the truth or falsity of A b *p[oi\ depends only on the values of 
a for those variables x which are free in ip . That is, if afix) = a 2 (x) for 
all x G free(<p), then A b <^[aq] iff A 1= ^[aq]. Thus, if <p — <p(xi ,..., x n ) 
and cy = cc(xi),..., a n — a(x n ), then we may write A b ip[a\,.. ., a n ] for 
A b <p[a\. In particular, if p> is a sentence, then the truth or falsity of 
A b (/?[«] is completely independent of a. Thus we write A b ip (read : 
A is a model of ip, or A satisfies ip), if for some (hence every) assignment 
a, A b <p[a\. Fromulas ip and ^ are equivalent if p> yy fit is true in all 
structures under all assignments (we sometimes write ip = ijj to express 
this). 

It is assumed that first-order logic contains two zero-ary relation sym- 
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bols T,F. In every structure, T and F are interpreted as TRUE (i.e., as 
being true) and FALSE, respectively (TRUE corresponds to the zero-ary 
relation {0}, and FALSE to the zero-ary relation 0). Hence, the atomic 
formula T is equivalent to 3x(x = x) and F to —E\x(x = x). 

If $ = {pi, ..., p n } we sometimes write /\ $ for pi A... A p n and \J d* 
for ipi V ... V p n . In case <L = 0 we set /\ <L = T and V = F. Then, for 
arbitrary Unite $, 


A h <3> iff for all p e <1>, A N p. 

Definition 1.3.4. (Quantifier Rank) 

The quantifier rank qr(p ) of a formula p is the maximum number of 
nested quantifiers occurring in it : 

- qr{ip) := 0, if p is atomic 

- qr(~>p) : = qr(p) 

- qr(p\Z'ijj ) max{qr(p),qr(fi>)} 

- qr(3xp) := qr(p) + 1 

Definition 1.3.5. For structures A and B and m 6 N we write A = m B 
and say that A and B are m-equivalent, if A and B satisfy the same first- 
order sentences of quantifier rank < m. They are elementarily equivalent 
if they satisfy the same first-order sentences, in symbols, A = B. 

Definition 1.3.6. We say that a class K of structures is definable (or 
that its characteristic property is expressible) in a logic C if there is a 
sentence p G C such that for every structure A : A G K iff A t= p. 

Proviso 1.3.7. Throughout the text all classes K of structures considered 
will tacitly be assumed to be closed under isomorphisms, i.e. 

A <E K and A = B imply B G K. 

Now we define the Ehrenfeucht-Fraisse games. 
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Definition 1.3.8. (Games : Ehrenfeucht) 

Let A and B be structures, a £ A s , b £ B s , and m £ N. The Ehrenfeucht 
game G m (A,a,B,b) is played by two players called Spoiler and Duplica¬ 
tor. Each player has to make m moves in the course of a play. The players 
take turns. In the f-th move Spoiler first selects a structure, A or B , and 
an element in this structure. If Spoiler chooses e* in A then Duplicator 
in its i-th move must choose an element /, in B. If Spoiler chooses fi in 
B then Duplicator must choose an element e, £ A. 

At the end, elements ei,... ,e m in A and A,..., f m in B have been cho¬ 
sen. Duplicator wins iff a e t— b f £ Part (*4., B) (in case m = 0 we just 
require that a i —> b £ Part (*4., B)). Otherwise Spoiler wins. Equivalently, 
Spoiler wins if, after some i < m, He\ ... e* K > bf\ ■ ■ ■ fi is n °t a partial 
isomorphism. We say that a player, Spoiler or Duplicator, has a winning 
strategy in G m (A } a,B,b), or shortly, that he wins G m (A,a,B,b), if it is 
possible for it to win each play whatever choices are made by the oppo¬ 
nent. If s = 0 (and hence a and b are empty), we denote the game by 
G m (A, B). 

Example 1.3.9. Spoiler has a winning strategy in Gs(A,B), where, 

A := ({0,1}, <) and B := ({0,1,2}, <) and in both cases < denotes the 
natural ordering. Spoiler can begin by choosing 0 in B and Duplicator 
has to reply with 0 in A because if it replies with 1 then, no matter what 
element (other than 0) Spoiler chooses from B in the second move, Du¬ 
plicator will not find an element in A greater than 1 to reply with. In its 
second move Spoiler chooses 2 from B and Duplicator has to reply with 
the only element in A greater than 0, which is 1. In its third move Spoiler 
chooses 1 from B (which is between 0 and 2) but Duplicator cannot find 
an element in A between 0 and 1 (which correspond respectively to 0 and 
2 in B). 

Definition 1.3.10. (Games : Algebraic Version - Fraisse) 

Structures A and B are said to be m-isomorphic, written A = m B, if 
there is a sequence ( Ij)j< m with the following properties: 

(a) Every Ij is a nonempty set of partial isomorphisms from A to B. 
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(b) ( Forth property) For every j < m, p E Ij+i, and a E A there is 
q E Ij such that q D p and a E do(q). 

(c) ( Back property ) For every j < m, p E Ij+i, and b E B there is 
q E Ij such that q D p and b E rg(q). 

If ( Ij)j<m has properties (a), (b), and (c), we write ( Ij)j<m ■ A = m B and 
say that A and B are m-isomorphic via (Ij)j< m - 

Theorem 1.3.11. J3J/, ff/ 

For structures A and B, a E A s ,b E B s , and m > 0 the following are 
equivalent : 

(i) Duplicator wins G m (A , a, B, b). 

(ii) There is ( Ij)j<m with a \-E~b E I m such that ( Ij)j<m '■ A = m B. 

(Hi) a satisfies in A the same formulas of quantifier rank <masbEB. 

Corollary 1.3.12. 

For structures A, B and m > 0 the following are equivalent : 

(i) Duplicator wins G m (A,B). 

(ii) A = m B. 

(Hi) A = m B. 

The equivalence of (ii) and (iii) is known as Fraisse’s Theorem, and the 
equivalence of (i) and (ii) shows that Ehrenfeucht’s game and Fraisse’s 
game are different formulations of the same thing, therefore, one often 
speaks of the Ehrenfeucht-Frai'sse game. 

The following theorem with the previous corollary illustrate how games 
can be used in proving non-expressibility : 

Theorem 1.3.13. 

For a class K of structures the following are equivalent : 

(i) K is not definable in first-order logic. 
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(ii) For each m there are structures A and B such that: 


A G K,B ^ K and A = m B. 


Example 1.3.14. Jjfjf Let r = {<,min,max}. Suppose that A and B are 
ordered r-structures, ||«4|| > 2 m and \ \B\\ > 2 m . Then A = m B. Hence, 
the class of orderings of even cardinality is not definable in first-order 
logic. For a proof, given any ordering C, we define its distance function 
d by 


d(a , a!) := ||{6 G C \ (a < b < a') or (a' < b < a)}||. 


And, for j > 0, we introduce the "truncated" j-distance function dj 
on C x C by 



Now suppose that A and B are orderings with ||^4||, ||£?|| > 2 m . For 
j < m set 

Ij := {p 6 Part(A,B) j dj(a,a') = dj(p(a),p(a')) for a, a' 6 do(p)}. 

Then (Ij)j< m '■ A = m B : By assumption on the cardinalities of A and 
B we have {(min A , min B ), (max A , max 13 )} G Ij for every j < m. To give 
a proof of the forth property of ( Ij)j< m (the back property can be proven 
analogously), suppose j < m, p G Ij+i, and a G A. We distinguish two 
cases, depending on whether or not the following condition 


(*) there is ana' G do(p) such that dj (a, a') < 2 3 


is satisfied. 

If (*) holds then there is exactly one b G B for which p U {(a,b)} is a 
partial isomorphism preserving dj-distances. Now assume that (*) does 
not hold and let do(p ) = {ai,...,a r } with a\ < ... < a r . We restrict 
ourselves to the case a* < a < a i+ \ for some i. Then, 


18 


dj(di,a ) = oo and dj(a,ai + 1 ) = oo; hence dj + \(ai,ai + i) = oo and there¬ 
fore, dj + i(p(ai),p(a i+ i )) = oo. Thus there is a b such that p(af) < b < 
p(a i+ i), dj(p(ai),b) = oo, and dj{b,p(a i+ 1 )) = oo. One easily verifies that 
q := p U {(a, 5)} is a partial isomorphism in Ij. 

Example 1.3.15. For l > 1, let Qi be the graph given by a cycle of 
length l + 1. To be precise, set 

Gi := {0,...,/}, E Gl : = {(i,j + l) | i < Z}U{(i + l,i) | i < 1} U 
{(0, /),(/, 0)}. Thus, forl,k > 1, the disjoint union GiOQk consists of a 
cycle of length l + 1 and of a cycle of length k + 1. We show : 


If l > 2 m then Q x = m QiOQi. 

In fact, for j G N, we define the distance function dj on a graph Q by 



(where d denotes the distance function on Q as introduced in 
Definition \1.2.9l) . To show that Qj and QjJQi are m-isomorphic, one 
verifies ( Ij)j<m '■ Gi —m GiOGi where Ij is the set of p G Part(Gi,GiClGi) 
with ||do(p)|| < m— j and dj(a,b) = dj(p(a),p{b)) for a,b G do(p). (The 
proof is similar to that of example \1.3.14\ ). 

Then by Theorem \1.3.1tA the class CONN of connected graphs is not 
definable in first-order logic, since for each m we have 


£/2 m ^ CONN , (^2 m U^2 m ^ CONN , G2 m =m G2 rn OGz 


Consequently, the path relation, the transitive closure of the edge rela¬ 
tion on the class GRAPH, is not first-order definable. In fact, suppose 
ijj(x,y) is a first-order formula defining the path relation on GRAPH. 
Then CONN would be the class of models of \/xWy(->x — y —>■ if(x,y)) 
(and the graph axioms). 

Definition 1.3.16. (Second-Order Logic) 

Second-order logic , SO, is an extension of first-order logic which allows 
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to quantify over relations. In addition to the symbols of first-order logic, 
its alphabet contains, for each n > 1, countably many n-ary relation 
(or predicate) variables V™, VJ 1 , .... To denote relation variables we use 
letters X, Y,.... The set of second-order formulas of a vocabulary r is the 
set generated by the rules for first-order formulas extended by : 

- If X is n-ary and t\,... ,t n are terms then Xt\ ...t n is a formula. 

- If <p is a formula and X is a relation variable then 3 Xip is a formula. 

The free occurrence of a variable or of a relation variable in a second- 
order formula is defined in the obvious way and the notion of satisfaction 
is extended canonically. Then, given p = ..., x nj Y \,..., Y k ) with 

free (individual and relation) variables among x\,... ,x n ,Yi,... ,Y k , a r- 
structure A, elements cq,..., a n G A, and relations Ri,..., R k over A of 
arities corresponding to Yi,..., Y*,, respectively, 

A h ^[aq,..., a n , Ri ,..., Rk] 

means that ai,... ,a n together with Ri ,..., Rk satisfy p in A. 

1.4 Fixed-Point Logics 

Here in this section we introduce some of the well-known fixed-point ex¬ 
tensions of first-order logic. Let us first pave the road to the definitions: 

Fix a vocabulary r. Consider a second-order formula ip(R, Y, x\, ..., Xk, u), 
where R is a relation variable of arity k, and let T be an interpretation 
of Y and b an interpretation of u, 

• For any r-structure A, the function 

F vA . p(^) 

F*’ A (S) \= {a e A k \ A^ <p[S,T, ai ,..., a k , b]} 
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gives rise to a sequence of k -ary relations : 


0, F^(0), (F^) 2 (0), (F^) 3 (0),... 


Denote its members by F^’ 14 , Ff’’’ 4 , Ff’’ 4 ,..i.e., F 0 v,vH = 0 and F^ = 
F <p ’ A (F^’ A ). F% ,a is called the n-th stage of F^” 4 (or of p in A under the 
given interpretations of Y and u). Suppose that there is an n 0 e N such 
that that is, = F*f. Then, F% A = for 

all m > n 0 . We denote Fjfj- 4 by F^ A and say that the fixed-point FQ A 
of F^’- 4 exists (It may be also called the fixed-point of p in A "under the 
given interpretations of Y and u". But if the free variables of ip are among 
R and x±,... ,Xk only, i.e., there are no extra variables to be interpreted, 
we just call it the fixed-point of p in A). 

In case the fixed-point F^ A does not exist, we agree to set F^ A := 0. 
When we talk about the stages and fixed-point of p in general, regardless 
of the interpretations of the extra free variables and regardless of the 
structure in which they are interpreted, or when the interpretations and 
the structures are understood from context, we just say the "stages" and 
the "the fixed-point", and write F% and Fj£. 

(Note that the notation F V,A does not make explicit all relevant data.) 

• Clearly, for any structure A, the sequence for the formula Rx V p is 
increasing (in fact more, the function F ( ' R ^ Vlf > ,A is inflationary, i.e., has the 
property that S C F( mv<p > ,A (S) for any S C A k ), and since the structure 
A is finite, the sequence must come to the fixed-point in at most ||*4.|| 
stages. This fixed-point (of Rx\/ p in A "under the given interpretations 
of the extra free variables") is called the inflationary fixed-point of p in A 
"under the given interpretations of the extra free variables". (Sometimes 
we call Rx\/ p "the inflationary formula obtained from p") 

• p is said to be F-positive if, when expressed using only the connectives 
-i, V, all occurrences of R in p are positive, i.e., are in the scope of an even 
number of negations. It can be easily proved (by induction on formulas) 
that if p is F-positive then it is monotone in the sense that : For every 
structure A, and every S 1 ,S 2 £ A k , S 1 C =>• F lp ’ A (S 1 ) C F‘ P ’ A (S 2 )- 


<fi,A _ 
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Clearly, if ip is monotone then for every structure A, 


0 C C (F*' A ) 2 ($) C (F^’' 4 ) 3 (0) C ... 

and hence its fixed-point exists (the sequence comes to it at a stage < 
11*4.| | fc ). This fixed-point is also the least hxed-point : Let S C A k be such 
that F‘ P ’ A (S) — S. 0 C S i.e. Ftf ,A C S. Assuming that F?’ A C S, where 
i > 0, we have by the monotonicity of ip that F‘ P,A (F^’’ A ) C F ip ’ A (S) 
i.e. Ff+ A C S. Hence, by induction, Ff )A C S for every % > 0. Thus 
F^ A C S. This shows that F^ A is contained in every other hxed-point 
of F" P,A 1 hence it is the least hxed-point of F V,A . 

Definition 1.4.1. (Fixed-Point Extensions of First-Order Logic) 

(1) Partial Fixed-Point Logic FO(PFP) : 

The class of formulas of FO(PFP) of vocabulary r is given by the 
calculus 

• - where ip is an atomic second-order formula over r. 

<p ^ 

• J£_ JP_ 

-i(^ 5 (<pV^) 5 3xtp 

• fuwfw —it where the lengths of x and t are the same and coincide 
with the arity of R. 

(The expression means that if <pi,... ,(pi are formulas then 

ip is a formula). The semantics is defined inductively with respect 
to this calculus, the meaning of [PFP^^t being t G F^. In par¬ 
ticular, [PFP^^jt is false if the hxed-point of ip does not exist. 
Sentences are formulas without free hrst-order and second-order 
variables where the free occurrence of variables are defined in the 
standard way, adding the clause 

free([PFP R)W ip]t) := free(t ) U (f ree(ip)\{x, R}) 

(2) Inhationary Fixed-Point Logic FO(IFP) : 


22 





The class of formulas of FO(IFP) of vocabulary r is given by the 
calculus with the same rules as the calculus for FO(PFP) except for 
the last rule which is replaced by 

" where the lengths of x and t are the same and coincide 

with the arity of R." 

The semantics for [IFP# i$ <p]f being t E F^ v<p \ 

(3) Least Fixed-Point Logic FO(LFP) : 

The class of formulas of FO(LFP) of vocabulary r is given by the 
calculus with the same rules as the calculus for FO(PFP) but the 
last rule is restricted to -R-positive formulas (p. In this case, as we 
have shown above, is monotone and hence its fixed-point exists 
and it is also the least fixed-point. 

Thus, in this case, the formula [PFP^tpjt expresses that t is in 
the least fixed-point of <p. We therefore write [LFP# )S <p]t instead of 
[PFPi^jt 


We sometimes write [PFPr^], [IFP^. s (p], or [LFPr^] to denote the 
fixed-points. 

Remark 1.4.2. 

Clearly, FO(LFP) < FO(IFP) < FO(PFP), (i.e. every formula in FO(LFP) 
is equivalent to a formula in FO(IFP), and every formula in FO(IFP) is 
equivalent to a formula in FO(PFP)). Actually, on every class of finite 
structures, 

FO(LFP) = FO(IFP) (Gurevich and Shelah 1986 [3]) 

Remark 1.4.3. 

It can be easily proved that the extra free first-order variables in fixed- 
point formulas, for example as u in page 20, can always be avoided at the 
expense of relations of higher arity. Thus, we can normally assume that 
no extra free first-order variables are present in the fixed-point formula. 
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Remark 1.4.4. 

These logics are strong enough to express transitive closure. Given a 
global binary relation R over the set of fc-tuples (of any structure), its 
transitive closure in any structure is expressible by 

[LFP x ,x,ytp(X,x, y)]xy 

where <p(X, x,y) := Rxy V 3zi ... 3zk(Xxz A Rzy ). 


Let us now present simultaneous fixed-points: 

Given two formulas tp (X, Y,x i,... ,Xk) and fi>(X, Y,yi,... ,yi), where X 
is a relation variable of arity k and Y is a relation variable of arity l, and 
a structure A, the functions 

F 1 :A k x A 1 -A A k 


F 1 (S 1 ,S 2 ) := {aeA k \A\=<p[S 1 ,S 2 ,a 1 ,...,a k \} 


and 


F 2 :A k x A 1 A 1 

-12/0 o \_ (Ur- aI 


F (Si, S 2 ) := {b E A \ A ip[Si, S 2 , b\,. 
give rise to two sequences : 


■M} 


F[o) ■= ^+ 1 ) := F\Ftn), F( n) ) for i = 1, 2. 


If we have for some n that (F ( ‘ n) , F 2 n) ) = (F^ +1)J F ( 2 n+1) ), we set 
(^oo), Ffa) := (F( n y Ff n) ) and say that the simultaneous fixed-point (F^, F^) 
of (F 1 , F 2 ) exists (we may also call it the simultaneous fixed-point of ip 
and il> in *4.). 

Note that for i — 1, 2, 


F'(FL).F^ = F'^y 
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1.5 Complexity 


In this section we introduce the definitions and facts we need from com¬ 
plexity theorey. This section is mainly form Immerman’s "Descriptive 
Complexity" [TO]. We assume that the reader is familiar with the Turing 
machine. 

Definition 1.5.1. A query is any mapping / : STRUC[cr] —> STRUCfr] 
from the finite structures of one vocabulary to the finite structures of 
another vocabulary, that is polynomially bounded. That is, there is a 
polynomial p such that for all A G STRUC [cr], ||/(^4)|| < p(||^4||). A 
boolean query is a map A : STRUC[cr] —> {0,1}, from the finite structures 
of a vocabulary to {0,1}. A boolean query may also be thought of as a 
subset of STRUC [cr] - the set of finite structures A for which 1(A) = 1. 

Any sentence ip, from any logic over any vocabulary cr, defines a 
boolean query I v on STRUC[cr] where I V (A) — 1 iff A 1= (p. 

Definition 1.5.2. (First-Order Queries) 

Let cr and r be any two vocabularies where r = {i?i,..., R ri Ci,..., c s } 
and each Rj has arity aj, and let k be a fixed natural number. A k-ary 
first-order query is a map 

/ : STRUC [cr] STRUC [r] 

defined by an r+s+l-tuple of first-order formulas, </? 0 , <fii, • • •, p> r , An ■ ■ •, Va 
from FO [cr]. For each structure A G STRUC [cr], these formulas describe 
a structure 1(A) G STRUC [r], 

1(A) = (|/(.4)|, R [ w ,.... R'JA, c[ w , .... c'JA) 

The universe of 1(A) is a first-order definable subset of A k , 

\I(A)\ = {(b\... 1 b k )eA k \A^p 0 [b\...,b k }} 


25 


Each relation R 1 ^ is a first-order definable subset of \I{A)\ a \ 

rR' = {«&!,■■■, tf), ■■■, ■■■. o 6 m n v<k • • •. <]} 

Each constant symbol is a first-order definable element of \I(A)\, 
cj ( ' 4) = the unique (b 1 ,..b k ) G |/(M)| such that A h ..., 6 fc ]. (Ev¬ 
ery 'ipj should be such that there is exactly one tuple (b 1 ,..., b k ) satisfying 
</3o A Vh i otherwise / does not dehne a first-order query) 

A first-order query is either boolean, and thus defined by a first-order 
sentence, or is a fc-ary first-order query, for some k. 

We can now say precisely what we meant by saying that every structure 
may be thought of as a graph : PI 

Let T g denote the vocabulary of graphs. For any vocabulary cr there exist 
first-order queries / : STRUC [cr] —>■ STRUC [t 9 ] and I~ l : STRUC^] —> 
STRUC [cr] with the following property, 

for all A G STRUC [cr], I~\l{A)) ^ A 

Definition 1.5.3. Let / : STRUC[cr] —y STRUC[r] be a query. Let M 
be a Turing machine. Suppose that for all A G STRUC [cr], M(bin(A )) = 
bin(I(A)). Then we say that M computes I. 

We assume that the reader is familiar with the following classical com¬ 
plexity classes : L deterministic logspace, P deterministic polynomial 
time, and PSPACE deterministic polynomial space. 

Definition 1.5.4. (Q(C), the queries computable in C) 

Let / : STRUC [cr] —$■ STRUC [r] be a query, and C a complexity class. 
We say that I is computable in C (or in C for short) iff the boolean query 
lb is an element of C. where 

R = {(A,i,a) | The i-th bit of bin(I(A)) is “a”}. Let Q(C) be the set of 
all queries computable in C: Q(C ) = C U {/ | I b G C}. 

Definition 1.5.5. (Many-One Reduction) 

Let C be a complexity class, and let K C STRUC [a] and H C STRUC [r] 
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be boolean queries. Suppose that the query / : STRUC[cr] —> STRUC[r] 
is an element of Q(C ) with the property that for all A G STRUCfcr], 

A e 1(A) g H 

Then / is a C-many-one reduction from K to H. We say that K is 
C-many-one reducible to //, in symbols, K <c H. For example, when / 
is a first-order query, the reduction is called a first-order reduction (<f Q ); 
when / G Q( L), the reduction is called a logspace reduction (<i og ); and 
when / G Q(P), the reduction is called a polynomial-time reduction (< p ). 

Definition 1.5.6. Let K be a boolean query, let C be a complexity class, 
and let < r be a reducibility relation. We say that K is C- complete under 
< r iff : 

(1) K G C, and, 

(2) for all H eC, H < r K. 

Definition 1.5.7. (Closure under First-Order Reductions) 

A set of boolean queries S is closed under first order reductions iff when¬ 
ever there are boolean queries K and H such that H G S and K <f D H, 
we have that K G S. We say that a logic £ is closed under first-order 
reductions iff the set of boolean queries definable in £ is so closed. 

Definition 1.5.8. (Alternating Turing Machine) An alternating Turing 
machine is a Turing machine whose states are divided into two groups : 
the existential states and the universal states. The notion of when such 
a machine accepts an input is defined by induction : The alternating 
machine in a given configuration C accepts iff 

1. C is in a final accepting state, or 

2. C is in an existential state and there exists a next configuration C' 
that accepts, or 

3. C is in a universal state, there is at least one next configuration,and 
all next configurations accept. 
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Note that this is a generalization of the notion of acceptance for a 
nondeterministic Turing machine, which is an alternating Turing machine 
all of whose states are existential. 

Definition 1.5.9. (Boolean Circuits) 

A boolean circuit is a directed acyclic graph 

C = (V, E, G A , G v , G., /, r) G STRUC[t c ]; r c = {E, G A , G v , G., /, r} 
where 

The vertices v with no edges entering them, are called leaves, and the 
input relation J(n) represents the fact that the leaf v is on (i.e. recieved 1 
as input). The vertices that are not leaves are called internal vertices. An 
internal vertex w is said to be an "and"-gate if G A (w) holds, an "or"-gate 
if G v (u>) holds, and a "not"-gate if G~,(w ) holds. Any internal vertex 
should be in exactly one of G A ,G V , or G-,. A "not"-gate should have 
indegree 1. r is a vertex with no outgoing edges and is called the root. 

Definition 1.5.10. (Computation by Boolean Circuits) 

Let S C STRUC[r s ] be a boolean query on binary strings (where r s is the 
vocabulary of binary strings). In circuit complexity, S would be computed 
by an infinite sequence of circuits 

C = {Ci | i = 1,2,...}, 

where C n is a circuit with n input bits (i.e., vertices with no ingoing edges, 
or leaves) and a single output bit r (i.e. there is only one vertex with no 
outgoing edges which is the root). For w £ {0, l} n , let C n (w ) be the 
value at G n ’s output gate, when the bits of w are placed in its n input 
gates (this defines the values at the input gates), and where the values at 
the other gates are defined inductively as follows (until we come to the 
output gate at the end) : If the gate is an "and"-gate then its value is 
1 iff the values of all gates from which there are edges entering it are 1, 
if the gate is an "or"-gate then its value is 1 iff at least one of the gates 
from which there are edges entering it has value 1, and if the gate is a 


28 


"not"-gate then its value is 1 iff the value of the unique gate from which 
there is an edge entering it is 0. 

We say that C computes S iff for all n and for all w G {0, l} n , 

w G S <S=> C n (w ) = 1. 

Definition 1.5.11. The size of a boolean circuit is the number of vertices 
in it, and its depth is the length of a longest path from root to leaf. 

Definition 1.5.12. (Uniformity) 

Let C = {Ci | i = 1,2,...} be a sequence of circuits, and let / : 
STRUC[r s ] —» STRUC[t c ] be a query such that for all n > 1, /(0 n ) = C n . 
That is, on input a string of n zeros the query produces circuit n. If / is 
a first-order query, then C is said to be a first-order uniform sequence of 
circuits. Similarly, if / G L, then C is logspace uniform. If / G P, then C 
is polynomial-time uniform, and so on. 

From now on, when we mention uniformity without specifying the 
kind, we mean first-order uniformity. 
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2 


Depth 


This chapter is devoted to the notion of depth of inductive definitions over 
finite structures. Roughly speaking, depth, is the number of iterations (of 
the function F^ on 0) needed to come to the fixed-point (of ip). 


2.1 The Notion of Depth 

Definition 2.1.1. 

Let (p(R, Xi,... , Xk) be a formula, where R is a relation variable of arity 
k, let A be a structure of size n. 

(1) Define the depth of <p in A, in symbols | tp A \, to be the minimum r 
such that 

pvA _ pi p,A 
r r+l ~ r r 

in case the fixed-point of F exists. If the fixed-point does not 
exist we agree to set \tp A \ := oo. 

It can be easily shown that < 2 n , in case the fixed-point of 
F^’ A exists. 

(2) Define the depth of ip as a function of n equal to the maximum 
depth of ip in A (over all structures A of size n) : 

MM := max{\ip A \} 

\\A\\=n 

(3) Define the inflationary-depth of ip in A, in symbols I^Iifp, as the 
depth of (. RxVip) in A, and the inflationary-depth of ip (as a function 
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of n), in symbols |<^|ifp, as the depth of (Rx V ip) (as a function of 
n ). 

We may also think intuitively of the depth for the simultaneous fixed- 
point and for the nested fixed-point as the total number of iterations 
needed to come to the fixed-point. 

Least fixed-point logic can be fragmentized into smaller logics by im¬ 
posing restrictions on depth. Before we mention the definition from Im- 
merman’s "Descriptive Complexity", we must mention following proposi¬ 
tion : 

Proposition 2.1.2. JEj.JTDJ 

On ordered structures, every FO(IFP)-sentence is equivalent to an FO(IFP)- 
sentence in which IFP occurs at most once. The same applies to FO(PFP) 
and PFP, and also to FO(LFP) and LFP. 

Hence when we talk about a fixed-point formula we may allow our¬ 
selves to imagine it as one in which the fixed-point operator occurs only 
once in front of a first-order formula, (or more precisely, in front of a 
second-order formula without any occurrences of a fixed-point operator 
or a second-order quantifier, but we sometimes call such formulas first- 
order for easiness). 

Definition 2.1.3. Let IND[/(n)] be the sub-logic of FO(LFP) in which 
only fixed-points of first-order formulas ip for which \ip\ is 0(f(n)) are 
included. 

From a previous remark, the fixed-point of a positive formula in a finite 

structure A comes at a stage < ||^4|| fe , where k is the arity of the relation 

variable through which the fixed-point is computed, i.e., the depth of 

positive formulas is always polynomially bounded. Hence we may assume 

that f(n) is polynomially bounded. 

It is clear now that, FO(LFP) = (J IND[n fc ] 

k> 1 

Example 2.1.4. (A Well-Known Example) In graphs, the transitive clo¬ 
sure of the edge relation is the least fixed-point of the formula 

<p(R, x, y ) := Exy V 3 z(E(x, z ) A R(z, y)) 
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i. e. for any u 7 ^ v there is a path from u to v iff 


[LFP RjXj y(p(R,x,y)]uv holds. 

In any graph Q, for any k > 1, 

(F ¥ ’ ,e ) fe ( 0 ) = {(x,y) \ x j- y and there is a path from x to y of length 
< k}, and since the distance (the shortest length of a path) from a vertex 
to another vertex connected to it in Q is at most n — 1 if \\Q\\ = n, the 
fixed-point is obtained at most at k = n — 1 i.e. after n — 1 iterations 
of the function Fon 0. Since for every n there is actually a graph on 
n vertices in which there is a path of length n — 1, then ip(R,x,y) is of 
depth n — 1, and therefore connectivity is expressible in IND[n\. Actually 
it is even expressible in IND[\ogn\ as the path relation can be expressed 
as the fixed-point of the formula 

'ifiR, x, y) := E{x, y) V 3 z{R{x, z) A R{z, y)) 
which is of depth [log n] + 1 . 

We now present another view of depth, also from Immerman’s "De¬ 
scriptive Complexity", depending on some form of writing positive formu¬ 
las that was originally introduced by Moschovakis |T3] (Moschovakis and 
Immerman both studied inductive definitions but the work of Moschovakis 
is different from the work of Immerman in that Moschovakis focused 
mainly on infinite structures while Immerman studies finite ones). The 
following theorem shows this form : 

Theorem 2.1.5. Let (p(R, Xi ,..., xjf) be an R-positive first-order for¬ 
mula, where R is a relation variable of arity k. Then there are quantifiers 
Q 1 ,..., Q s , and quantifier-free first-order formulas Mi, ..., M s , M s+ \ in 
which R does not occur such that 


ip(R, xi...x k ) = (QiZx, Mf)... ( Q s z s , M s )(3xi ... 3x k , M s+1 )Rx 1 , ...,x k 

where for formulas if and M, (3 x,M)ijj means 3x(M A'tf), and (Vx, M)^ 
means Vx(M — > if). 
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Proof. This theorem is proved for structures with two distinguished ele¬ 
ments 0 and 1. 

The proof is by induction on the complexity of p. We assume that all 
negations have been pushed all the way inside. 

There are two base cases : 


(1) If p = Rv i... Vk, then, 


p = (3z 1 ... 3 z k , M 1 )(3x 1 ... 3x k , M 2 )Rx 1 ...x k 


where 


Mi = z± = Vi A ... A z k = v k , 


and 


M 2 = x x = Zi A ... A x k = z k 


(2) If p is quantifier-free and R does not occur in p, then, 


p = (Vz, ~^p){3x 1 ... 3x k , xi ^ xi)Rxi... x k 


In the inductive cases p = 3vif and p = Vu^, we simply put (3n,n = v) 
and (Vn, v = v) in front of the quantifier block for i/j. Now there remains 
the cases for A and V. 

Suppose that p = a A /3 and 

a = (QiVuNx)... (Qty t ,N t )( 3x 1 ... 3x k ,N t+1 )Rx l ...x k 

and 


/3 = (Pi^i, Mi)... (. P s z s , M s )(3a;i... 3x k , M s+1 )Rxi ...x k 

where the Q ’s and P’s are quantifiers. We may assume that the yf s 
and z' s are disjoint and, are both disjoint from the free variables of p. 
Let 

QBi := (Qw!, N[)... (Q t y t , N{) 
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and 

QB 2 := (P 1 z 1 ,M[)...(P s z sl M' s ) 

where N- : = V (v = 0); and M[ := M* V (v = 1). 

Let Mi,..., Uk be a new set of variables, and for any formula 0(aq ,..., Xk), 
let i/>(u/x) denote the formula 0 with variables u \,... ,Uk substituted for 
the free occurrences of x\,... , Xk, and define the quantifier-free formulas, 

9 (v — 1 A N t+ i(u/x)) V (v = 0 A M s+ i(u/x)) 


and 


p := (ui = x x A ... A u k = x k ) 


We show that 


ip = (Vm, (v = 0 V v = 1 ))QBiQB 2 (3u, 9)(3x, p)Rx i... Xk (*) 

Put -0 := QBiQB2(3u,9)(3x, p)Rx \... Xfc.The right hand side of (*) 
holds iff in both cases, v = 0 and v — 1, the formula 0 holds. When 
v — 1 , the formula 9 is equivalent to N t+1 (u/x) whose free variables are 
from {yi ,..., y t , u ±,..., Uk} U free(a). Also all the formulas M[ are true 
when v — 1, thus QB 2 is the same as P\Z\... P s z s . And since z ±,..., z s 
are disjoint from yi,..., y t , ui ,..., Uk , and free(a), then 

< 5 ^ 2 (3m, 0)(3x, p)Rx i... Xk is equivalent to 

(3m, N t+ i(u/x))(3x, p)Rxi ... Xk, which is equivalent to 
(3x, N t+ i)Rxi ...x k . 

N- is equivalent to N t for every 1 < i < t when v = 1, so QB\ is the same 
as (Qiyi, Ni)... ( Q t yt , A^j.Thus 0 is equivalent to 


(QiUi,Ni) ■ ■ ■ (Qtyt, N t )(3x, N t+ i)Rxi ...x k 


i.e. equivalent to a. It can be similarly seen that, in case v = 0, 0 is 
equivalent to 0. Thus the right hand side holds iff both a and 0 hold 
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i.e. iff ip holds. It can now be similarly seen that if ip = a V /3 then 
ip = (3n, (v = 0 V v = l))^- □ 

Let us write QB to denote the quantifier block 

{Qizi, Mi )... ( Q s z si M s )(3x i... 3x kl M s+1 ) 

Thus in particular, for any structure A, any r E N, any a E A k , 

a E (F^ A ) r (0) iff A^{[QB] r F)[a\ 

Here [ QB] r means QB literally repeated r times. It follows immedi¬ 
ately that if t(n) = |<^|(n) and A is any structure of size n then 

A t= ([LFP R , X1 ,..., X MV ^ (lQBf n) F)(y)M for all a E A k . 

This directly inspires the following (apparently) more general defini¬ 
tion : 

Definition 2.1.6. [ ID] FO[t(n)] is defined to be the class of properties 
definable by quantifier blocks iterated t(n) times. (This is the same as 
being iterated 0(t(n)) times since a quantifier block may be any con¬ 
stant size). More precisely, a class S of finite structures is a member of 
FO[£(n)] iff there exist quantifier-free first-order formulas M i: 0 < i < s, 
a quantifier block QB = (Qi^i, Mi)...(Q s x s , M s ), a tuple c of constants 
(if necessary), and a function f{n) = 0(t(n )) such that for any structure 

A, 

AeS A N ([QB] f ^M 0 )(c/x) 

The reason for the substitution of constants is that the quantifier block 
QB may contain some free variables that must be substituted for to build 
a sentence. 

Clearly, for all polynomially bounded t(n), 

IND [t(n)} C FO[t(n)] 
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2.2 The Basic Theorems 


In this section we will see how depth can be regarded as a complexity 
measure through theorems relating it to well-known complexity classes. 
For example, it turns out that inductive depth eaquals parallel-time. The 
theorems in this section are from Immerman’s "Descriptive Complexity" 
ra and therefore we must impose here the same proviso he imposed at 
the beginning of his book : 

Proviso 2.2.1. We assume that the numeric relations and constants : 
<, PLUS, TIMES, BIT, SUC, 0, 1, max are all present in all vocabu¬ 
laries and are interpreted as follows : 0, 1, max are interpreted as the 
minimum, second, and maximum elements under < (which is the usual 
total ordering inherited from N ); this means we assume that all structures 
contain at least two elements 0 and 1. PLUS(i,j , k) means that i + j = k, 
TIMES(i,j,k) means that i x j = k, BIT(i,j) means that bit j in the 
binary representation of i is 1, and SUC is the usual successor. 

We begin with the theorem showing that on ordered structures least 
fixed-point, logic corresponds to the famous complexity class P : 

Theorem 2.2.2. (Immerman and Vardi 1982 

FO(LFP) = P 


i.e. a class of ordered structures is definable in FO(LFP) if and only if it 
is accepted by a deterministic polynomial-time machine. 

For a proof we must first mention some definitions and facts. 

Definition 2.2.3. (Alternating Reachability REACH a ) 

Let T ag = {E, A, s, t} be the vocabulary of alternating graphs. An alter¬ 
nating graph Q = (G, E*, AP , s G , t s ) is a directed graph whose vertices are 
labeled universal or existential. A s C G is the set of universal vertices. 
Let Pffxy be the smallest relation on vertices of Q such that : 
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(1) Pgxx 

(2) If x is existential and P%zy holds for some edge (x, z ) then Pfixy 

(3) If x is universal, and there is at least one edge leaving x, and Pffzy 
holds for all edges (x, z), then Pfixy 


REACHq, := {Q I P Q a st) 

It is clear that Pj'st iff [LFP^^jsf where 

p := x = y V (Ax A 3zExz A Mz(Exz -A Rzy )) V (->Ax A 3 z(Exz A Rzy)) 

also it is easy to see that if (x,y) G (-F‘ / 3 ,s ) fc ( 0 ) then the distance between 
x and y in the underlying graph is k — 1, hence the depth of <p is n. Thus 
REACH a e IND[n]. 

Theorem 2.2.4. /T7]/ 

REACH a is complete for P under first-order reductions. 

Definition 2.2.5. Let r, a, I be as in Definit.ion ll.5.21 Then / also defines 
a dual map, which is called /, from FO(LFP)[r] to FO(LFP)[cr]. For any 
formula p € FO(LFP)[r], I(p) is the result of replacing all relation and 
constant symbols in p by the corresponding formulas from the definition 
of /, using a map // defined as follows : 

- Each variable is mapped to a fc-tuple of variables : 

f,( v):=v\...,v k 

- Input relations are replaced by their corresponding formulas : 

MRiV i • • • v ai ) := pfif! Oi),..., fi(v a .)) 

- Quantifiers are replaced by restricted quantifiers : 

fi(3v) := (3fi(v),p 0 (fi(v))) 
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- The equality relation and other numeric relations are replaced by 
their appropriate formulas. 

- Second-order variables have their arities multiplied by k] and second- 
order quantifiers must be restricted. 

- On boolean connectives, fj is the identity. 

- Constant q s replaced by a /c-tuple of special variables (i.e. variables 
not used in any other place of the formula being formed) : 

fi(ci ) := 4, ■■■,4 

and these variables must be quantified before they are used. Typi¬ 
cally, these quantifiers can be placed at the beginning of the formula. 

Now the mapping / is defined as follows, for 6 G FO(LFP)[r] : 

m := 

(34 ... 34, (p o A ipi)(z{, • • •, 4)) • • • ( 3 4 • • • 34, (Fo A V4(4, • • •, 4))(//4)) 

Proposition 2.2.6. [Tty Let t, a, and I be as in Definition ] 1.5 A Then 
for all formulas 6 G FO(LFP)[t] and all structures A G STRUC[a\, and 
all assignments a in A, 

A\=I(9)[a] iff 1(A) \=9[a'] 

Where a' is defined at a variable x iff (a(x 1 ),..., a(x k )) G 1(A) (Recall 
that fj(x ) — x 1 ,... ,x k ). In this case a'(x) = (a^a; 1 ),... ,a(x k )). 

Remark 2.2.7. Let a, r, I be as in Definition 11.5.21 let K be a class of 
u-structures and H a class of r-structures. Suppose that K is first-order 
reducible to H via /, and H is definable by the least-fixed point sentence 
<p. Then A G K iff 1(A) G H iff 1(A) h ip iff A t= 1(f)- Ttm s ? to 
prove that FO(LFP) is closed under first-order reductions, it suffices to 
prove that for any least-fixed point formula p from FO(LFP)[r], I(p) is 
in FO(LFP)[cr]. But this follows directly from the fact that FO(LFP) is 
closed under boolean operations and first-order quantification. 
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Remark 2.2.8. Note that if 0 := \LFP R )X)y g)(R 1 x, y))xy is of depth n m , 
then : 

1(0) is _ jyk (Vt(Si (fi 0 (t)) A(p 0 (x) /\ipo(V) Al((p))]xy, and 

is of depth n km , because for any structure A, and any formula <p, ip takes 
on 1(A) exactly the same number of iterations as I (<p) on A to come to its 
fixed point, while ||/(*4.)|| = ||w4.|| fc . For this reason we tend to think that 
IND[n m ] would not turn out to be closed under first-order reductions. 

Remark 2.2.9. The hierarchy IND[n] C IND[n 2 ] C IND[n 3 ] C ... col¬ 
lapses to the 2 -tli level if and only if IND[n*] is closed under first-order 
reductions. In that case P = IND[n*]. To see this, note that from 

OO 

Theorem ITT7] P = FO(LFP), and since FO(LFP) = JIND[n fc ], then 

k=1 

OO 

P = [JIND[n fc ]. So, if this hierarchy collapses to the 2 -tli level, then 

k=1 

P = IND[n*], and then IND[n l ] is closed under first-order reductions 
because P is so closed. REACH a e IND[n l ], for any % > 1, and it is 
P-complete under first-order reductions. Thus, if IND[rd] is closed under 
first-order reductions, then P = IND[n l ], i.e., the hierarchy collapses to 
the 2 -th level. 

We also need the following fact : 

Theorem 2.2.10. JJU/ Every first-order boolean query is computable in 
deterministic log space. 

Now we prove Theorem 12.2.21 

Proof. FO(LFP) C P : 

Let A be an input structure, let n = ||^4||, and let [LFP R }X1 ,...,x k ( p]y 
be a least fixed-point formula (where <p is first-order). This fixed 
point evaluated on A is (F VvA ) nk ($). We know from the previous 
theorem that for any S C A k , any (ai,...,dfc), whether A satis¬ 
fies <p(S, ai ,..., afc) can be determined in logspace. Let M 0 be the 
logspace Turing machine that computes the query tp(S, oq,..., af) 
(The vocabulary now contains an extra k -ary relation symbol S, 
and extra k constants). 
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Let M be the Turing machine that, starting with S — 0 (let it have 
a tape for coding S in n k bits), cycles through all possibilities for 
(cq,..., a*,), substitutes each of these for (aq,..., aq) and runs M 0 
to see whether A h (p(S , Gq,..., a*,), then writes 1 on the bit corre¬ 
sponding to (cq,..., a,k) in the tape of S if this holds. 

The machine M does this for n k times (the number of tuples). S is 
coded on n k bits of a tape, so rewriting a bit of it takes, in addition 
to the logspace work of Mo, a polynomial time (to reach the bit 
that will be rewritten). From this and since M 0 is run n k times (the 
number of stages to reach the fixed-point), and M 0 is logspace, and 
LCP, then M is a polynomial time machine. 

P C FO(LFP) : 

Since any query in P is first-order reducible to REACH a by Theorem 
12.2.41 and REACH a e FO(LFP), and FO(LFP) is closed under first- 
order reductions, then any query in P is in FO(LFP). 

□ 

There is an analogue of this theorem for polynomial space. The com¬ 
plexity class of polynomial space has several descriptive characterizations. 
One of these is in terms of depth of inductive definitions; it is the set of 
boolean queries expressible by first-order quantifier blocks iterated expo¬ 
nentially : 

Theorem 2.2.11. 


PS PACE = FO(PFP) = FO[2 n ° W ] 

The equality of PSPACE and FO(PFP) was shown by Vardi (1982) 
ca. and the fact that PSPACE = FO[2 n ° a) ] was shown by Immerman 
(1980) [B]. 

We now exhibit without proof theorems relating certain inductive depths 
(possibly with another restriction) with certain complexity classes (with 
a corresponding restriction). 
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We begin with the theorem showing that inductive depth equals parallel¬ 
time and equals circuit depth in circuit complexity, but first we should 
specify the definitions of these complexity measures : 

Definition 2.2.12. [TO] 

A concurrent random access machine (CRAM) consists of a large number 
of processors, all connected to a common, global memory. The processors 
are identical except that they each contain a unique processor number. 
At each step, any number of processors may read or write any word of 
global memory. If several processors try to write the same word at the 
same time, then the lowest numbered processor succeeds. 

This is the "priority write" model. The results in this section remain 
true if instead we use the “common write" model, in which the program 
guarantees that different values will never be written to the same location 
at the same time. 

Each processor has a finite set of registers, including the following, Pro¬ 
cessor : containing the number between 1 and p[n) of the processor (the 
number of processors in the machine is a function p{n ) of the size of the 
input); Address : containing an address of global memory; Contents : 
containing a word to be written or read from global memory; and Pro- 
gramCounter : containing the line number of the instruction to be exe¬ 
cuted next. The instructions of a CRAM consist of the following : 

READ : Read the word of global memory specihed by Address into 
Contents. 

WRITE : Write the Contents register into the global memory location 
specihed by Address. 

OP R a Rb : Perform OP on R a and Rb and leave the result in Rb . Here 
OP may be Add, Subtract, or, Shift (where Shift(a:, y) is the instruction 
that causes the word x to be shifted y bits to the right). 

MOVE R a R b : Move R a to Rb . 

BLT R L : Branch to line L if the content of R is less than zero. 

The above instructions each increment the ProgramCouter, with the 
exception of BLT which replaces it by L , when the content of R is less 
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than zero. 

We assume initially that the contents of the first \bin(A)\ words of global 
memory contain one bit each of the input string bin(A). We assume also 
that a section of global memory is specified as the output. One of the bits 
of the output may serve as a flag indicating that the output is available. 
Our measure of parallel-time complexity will be time on a CRAM. Define 
CRAM[f(n)] to be the set of boolean queries computable in parallel-time 
t{n ) on a CRAM that has at most polynomially many processors. 

Definition 2.2.13. (Circuit Complexity) 

Let t{n ) be a polynomially-bounded function and let S' be a class of finite 
structures (of some fixed vocabulary). S is in AC[f(n)] if there is a uniform 
class of circuits Ci, C 2 , ■ ■ ■ with the following properties : 

(1) For all structures A, A £ S iff C\\a\\ accepts A. 

(2) There is a function f(ri) = 0(t(n )) such that for every n, the depth 
of C n is f(n). 

(3) For every n, the gates of C n consist of unbounded fan-in "and" and 
"or" gates. 

Theorem 2.2.14. [ty.jTDJ 

For all polynomially-bounded parallel-time constructible t(n ) : 

IND[t(n)] = FO[t(n )] = CRAM[t(n)] = AC[t(n )] 

The number of variables in an inductive definition determines the 
number of processors needed in the corresponding CRAM computation. 
The intuitive idea is that using k log tv bit variables, we can name approx¬ 
imately n k different parts of the CRAM. Thus, very roughly, k variables 
correspond to n k processors. The coming theorem illustrates the relation¬ 
ship between the number of variables in an inductive def ini tion and the 
number of processors in the corresponding CRAM. 

Before the theorem we must mention some definitions : 
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Definition 2.2.15. p]I| CRAM-PROC[t(n),p(n)] is the complexity class 
CRAM[t(n)] restricted to machines using at most 0(p(n)) processors. 

Definition 2.2.16. [TO] IND-VAR[t(n), s] is the complexity class IND[£(n)] 
restricted to inductive dehnitions using at most s distinct variables. 

Theorem 2.2.17. JQj If the maximum size of a register word and t(n ) 
are both o(y/n) and k > 1 is a natural number, then 

CRAM-PROC[t(n),n k } C IND-VAR[t(n), 2k + 2] C CRAM-PROC[t(n), n 2k+2 } 

We conclude the section with a theorem illustrating the relationship 
between inductive depth and alternating complexity. (Recall that the 
definition of alternating Turing machines involved alternating between 
universal and existential states and that in FO[t(n)] when the quantifier 
block is iterated the quantifiers alternate). 

Definition 2.2.18. [TO] 

ASPACE-ALT[/(n), g(n)\ is the class of boolean queries accepted by 
alternating Turing machines simultaneously using space f(n) and making 
at most g[n) alternations between existential and universal states starting 
with existential. 

Theorem 2.2.19. (Ruzzo and Tompa 1984 PJj) 

For t(n ) > logn, 

ASPACE-ALTfogn, t{n)\ = AC[t(n)} = FO[t(n )] 

2.3 Examples 

In this section we give examples to show that, in general, there is no 
order comparison between the depth of a formula and its inflationary 
depth, and no order comparison between the depth of the iteration it 
takes to come to the simultaneous fixed-point of two formulas and the 
depth of the iteration it takes to come to their nested fixed-point, if the 
relation variables through which the iteration is made are not restricted 
to be positive. (Throughout the section S denotes the usual successor). 
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Example 2.3.1. Consider the formula 
ip(X,x) := 

(Vy-iXy A x = min) 

V 

(3y(Xy A \/z(Xz —> z = y)) A 
( 3y(Xy A Syx) V (Xmax A (Sxmax Vs = max)))) 

V 

(3y3^(X|/ A Xz A Vu(Xu -P-(u = y\/u = z))Ay<zA 

{(y 0 A (x = z V S'#?/)) V (?/ = 0 A (5xz V 3 m(S , xm A S’-uz)))))) 

V 

(XO A XI A Vy(Xy —>• (y — 0 V y = 1)) A (x — 0 V x — 1)) 

For any positive integer n and any structure A of size n, the sequence 
0, F l/, ’- 4 (0), (F^’- 4 ) 2 (0),... is as follows : 

MOM 1 },---,!™ - l},{n-2,n- l},{n-3,n- 1}, 

{n — 4, n — 1},..., {0, n — 1}, {n — 3, n — 2}, {n — 4, n — 2},..., 

{0, n - 2}, {n - 4, n - 3}, ..., {1, 2}, {0, 2}, {0,1}, {0,1}, {0,1},... 

The fixed-point is {0,1} and it comes after n+(n — l) + (n — 2) + .. .+2 + 1 
iterations, i.e., after iterations. While for the inflationary fixed- 

point, which is the fixed-point of the formula Xx V +(X, x), we have the 
sequence : 0, {0}, {0,1}, {0,1},.... So the fixed-point comes just after two 
iterations. Hence | + |(n) > \Lp\ IFP {n); \ip\(n) £ 0(n 2 ) while \<p\iFp(n) is 
constant. 

Example 2.3.2. Consider the formula 
X,x) := 

(fi/y^Xy A Vz(z < x —* Xz)) V (3 yXy A x — 1) 

V (3u3v(Xu A Xv A u f v) A Vz(z < x —» Xz)) 

For any positive integer n and any structure A of size n, the sequence 
0, F^’- 4 (0), (F^’- 4 ) 2 (0),... is as follows : 

That is the fixed-point of this formula comes just after two iterations, 


45 



while for its inflationary fixed-point we have the sequence : 

0, {0}, {0,1}, {0,1, 2}, {0,1, 2, 3},..., {0,1,..., n- 1}, {0,1,, n-1}, ... 

That is, its inflationary fixed-point comes after n iterations. Thus |^>|(n) = 
2 while \if\iFp{n) = n. 

The following example shows that the nested fixed-point of two formu¬ 
las can actually take more iterations than their simultaneous fixed-point 
when each one of them is positive in both A" and Y. 

Example 2.3.3. 

Consider the formulas 

(p(X, Y, x ) := x = 0 V 3z(Xz A Szx ) 

and 

if(X, Y, y) := y = 0 V 3 z(Yz A Szy). 

In any structure of size n the computation of their simultaneous fixed- 
point goes on as follows : 

(0, 0), ({0}, {0}), ({0,1}, {0,1}),..., ({0,1,..., n - 1}, {0,1,..., n - 1}), 
({0, 1,... ,n — 1}, {0,1,..., n — 1}),... 

i. e. it takes n iterations to compute the simultaneous fixed-point. 

While for the nested fixed-point [LFPx, x p(X, [LFPy^i X, Y, y)], x)] 7 First 
X is interpreted with 0 and n iterations take place to compute the fixed- 
point of if before we can evaluate the new interpretation of X which is {0}. 
In general, with every interpretation of X it takes n iterations to compute 
the fixed-point of if before we can evaluate the new interpretation of X; 
and it can be easily seen that the interpretations of X grow as follows 
0, {0}, {0,1},..., {0,1,..., n — 1}, {0,1,..., n — 1},.... Thus it takes a 
total of n(n + 1) iterations to come to the nested fixed-point. 

By intuition one expects the nested fixed-point of two formulas to 
take more iterations than their simultaneous fixed-point if the relation 
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variables through which the iteration is made are positive. 

Open Question : Are there formulas (p(X,Y,x) and ',Y,y) positive 
in both X and Y such that their simultaneous fixed-point takes “more" 
iterations than one of the two nested fixed-points ? 

If the occurrences of X or Y are not restricted to be positive, we can find 
such examples : 

Example 2.3.4. 

Let 

<p{X,Y,x) := 

s = 0 V (s = 1 A Mz(z 7 ^ max —>■ Yz)) V 3z{Xz A z ^ 0 A Szx ) 

V 

(XO AllA WzYz A x = x) 
and 

V>( X,Y,y) := 

y = 0 V 3 z(Y z A Szy Ay ^ max ) V (3 zY z A (y = y —» ~<3zXz)) 

V 

(XO A XI A -i 3 zY z A y — y) V (V^F z Ay = y) 

ip is positive in both X and Y, but is not positive in X nor in Y. 

In any structure A of size n the computation of their simultaneous fixed- 
point goes on as follows : 

( 0 , 0 ), ({ 0 }, { 0 }), ({ 0 }, { 0 , 1 }), ({ 0 }, { 0 , 1 , 2 }),..., ({ 0 }, { 0 , 1 ,..., n- 2 », 
({ 0 , 1 }, { 0 , 1 , • • •, n — 2 », ({ 0 , 1 , 2 }, { 0 , 1 ,..., n — 2 », ..., 

({ 0 , 1 , ..., n — 1 }, {0, 1 , ... ,n — 2 », ({ 0 , 1 ,..., n — 1 }, {0, 1 ,. .., n — 2 », ... 

i.e. it takes 2n — 2 iterations to come to the simultaneous fixed-point. 
While for the nested fixed-point [LFPx,x(p(X, [LFPY, y f>(X, Y, y)],x)], when 
first X is interpreted with 0, the computation of the fixed-point o/^(0, Y, y) 
goes on as follows : 0 , { 0 }, { 0 , 1 ,..., n— 1 }, { 0 , 1 ,..., n— 1 },..., that is, its 
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fixed-point comes in just two iterations. Now X has the new interpretation 
{a G A | A 1= + [0, {0,1,... ,n — l},a]} which is {0,1}. With this inter¬ 
pretation of X the computation of the fixed-point of -0({O,1}, Y, y) goes 
on as follows : 0, {0,1,..., n — 1}, {0,1,..., n — 1},..., that is, its fixed 
point comes in just one iteration. The new interpretation of X should be 
{a E A | A t= (^[{0,1}, {0,1,, n — 1}, a]} which is {0,1,..., n—1}. With 
this new interpretation of X the fixed-point of fi>{{ 0,1,..., n — 1}, Y, y) is 
{0,1,..., n — 1}; and {a E A \ A 1= ^[{0,1,..., n— 1}, {0,1,..., n— 1}, a]} 
is equal to {0,1,..., n—1}. Hence {0,1,..., n—1} is the nested fixed-point 
and it comes after a total o/(2+ 1 + 1 + 1), i.e., 5 iterations. 

Example 2.3.5. 

Let <p(X, Y, x ) := ifi/u^Xu A x = 0) V (3 uXu A x — 1) 
and 

if(X, Y, y ) := (3 uXu A 'iu~Yu At/ = «/)Vj/ = 0V 3 z(Yz A Szy) 
ip is not positive in X, and is not positive in Y. 

For any structure A of size n the computation of their simultaneous fixed- 
point goes on as follows : 

(0,0),({O},{O}),({1},{O,1}),({1},{O,1,2}),..., 

({!}> (0) 1) • • • > n — 1}), ({1}, (0,1,..., n — 1}),... 

i.e. it takes n iterations to come to the simultaneous fixed-point. 

While for the nested fixed-point [LFP Yty 'il’([LFPx,x l p(X, Y, x)], Y, y)\, when 
first Y is interpreted with 0, the computation of the fixed point of<p(X, 0, x) 
goes on as follows : 0, {0}, {1}, {1},..., that is, it takes just two iter¬ 
ations to come to its fixed-point. Now Y has the new interpretation 
{a G A | A N ^[{l}^, a]} which is {0,1, ...,n — 1}. With this inter¬ 
pretation of Y the fixed point of <p(X, (0,1,..., n — l},x) is {1}; and 
{a E A \ A ^[{1}, {0,1 ,..., n — 1}, a]} is equal to {0,1,,..., n — 1}. 
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Hence {0,1,..., n — 1} is the nested fixed point and it comes after a total 
of (2 + 1), i.e., 3 iterations. 

One more example to show that the nested fixed-point can take ac¬ 
tually more iterations than the simultaneous fixed-point; but here the 
occurrences of the relation variables are not restricted to be positive. 

Example 2.3.6. 

Let 

<p{X,Y,x) := 

{3uYu A 

3z(fi/u(Yu —» z < u) A ( yv(Vu(Yu —» v < u) —» v < z)) A (x = z V Sxz ))) 
V 

3z(Xz A Sxz) V x = 0 
and 

i>(X,Y,y) := 

3z(\/u(Xu u < z) A (' \/v(\/u(Xu —* u < v) —> z < v)) A y = z) V 
A Szy) V Yy. 

For any structure A of size n the computation of their simultaneous fixed- 
point goes on as follows : 

( 0 , 0 ),({O},{O}),({O},{O, 1 }),({O},{O, 1 , 2 }), 

({ 0 }, {0,1,... ,n — 1 }), ({ 0 }, { 0 ,1,..., n — 1 }),... 

i.e. it takes n iterations to come to the simultaneous fixed-point. While 
for the nested fixed-point [LFPx )X f{X-, [LFPY, y 'ip(X, Y, y)\, x)\, when first 
X is interpreted with 0, the computation of the fixed-point of i/j((t\,Y,y) 
goes on as follows : 

0 ) {0}) {0,1}, {0,1, 2},..., {0,1,..., n — 1}, {0,1,..., n — 1},... 
that is n iterations to come to the fixed-point. The new interpretation of 
X is {a G A | A b +[0, {0,1,..., n — 1}, a]} which is {0}. With this new 
interpretation of X, the computation of the fixed-point o/ , 0({O}, Y, y) goes 
on as follows : 
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0){1}j{1)2},{1,2, 3},...,{1,2, — 1},{1,2, ..., 77 , — 1},... 

that is n — 1 iterations to come to the fixed-point; and then the new in¬ 
terpretation of X is {a e A | A N <£>[{0}, {1, 2,..., n — l},a]} which is 
{0,1}. 

In general the interpretations of X grow as follows 
0 ) {0}) {0,1}, {0,1, 2},..., {0,1,..., n — 1}, {0,1,..., n — 1},.. 
and {0,1,..., n — 1} is the nested fixed point, while for every one of these 
interpretations before the fixed-point, say {0,1,... , m} with m < n — 1, 
the computation of the fixed-point of 0,1,..., m}, Y, y ) goes on as fol¬ 
lows 

0, {m + 1}, {m + 1, m + 2}, {m + 1, m + 2, m + 3},..., 

{m + l,m + 2,... ,n — 1}, {m + 1, m + 2 ,... ,n — 1},... 

?'.e. n — [m + 1) iterations to come to the fixed-point. Hence the total 
number of iterations to come to the nested fixed-point is 
n + l + (n-l) + l + (n-2) + l + ... + 2 + l + l + l i.e. ^±12 + n. 
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3 Finite Variable Logics 


We have seen in the previous chapter that parallel-time and inductive 
depth are equal and are equal to the depth of iteration of a quantifier block 
(with each iteration the quantifier rank of the resulting formula increases). 
We have also seen that the number of variables in an inductive definition 
is related in some way to the number of processors in the corresponding 
CRAM. Accordingly, it is important to study the expressive power of 
number of variables and quantifier rank. This takes us to Finite Variable 
Logics. 

3.1 Pebble Games 

In this section we introduce the basics of finite variable logics and their 
respective games (pebble games). This section is mainly from [2]. 

Definition 3.1.1. FO^ is the fragment of first-order logic of formulas 
that use at most k distinct variables (free and bound), and FO[fc,n] is 
the fragment of first-order logic of formulas of quantifier rank at most n 
that use at most k distinct variables (free and bound). 

Definition 3.1.2. (Pebble Games) 

Fix a vocabulary r and let * be a symbol that does not belong to the 
universe of any structure. For a 6 (A U {*}) s , a = a\... a s , let 
suppifa ) := {i\ at <E A } be the support of a, and if a G A, let af denote 
a\... di-id d l+ \ ... d s . For a G (A U {*}) s and b E (B U {*}) s we say that 
a t —b is an s-partial isomorphism from A to B if supp(a) = supp(b) and 
af t —> b is a partial isomorphism from A to B, where af and b are the 
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subsequences of a and b with indices in the support. 

Let A and B be structures, a G (AU{*}) S , b G (L> U{*}) s with supp(a ) = 
supp(b). In the pebble game G%fA, a, B,b ) we have s pebbles ay,..., a s 
for A and s pebbles fii ,..., (3 S for B. Initially, cq is placed on a* if a* G A, 
and off the board if a* = *, and similarly for B, /?* and bi. 

A play of this game consists of m moves. In its j'-th move, Spoiler selects 
a structure A or B , and a pebble for this structure (being off the board 
or already placed on an element). If it selects A and ay it places cq on 
some element of A. and then Duplicator places A on some element of B. 
If Spoiler selects B and /3*, it places /3, on an element of B and Duplicator 
places a.i on some element of A. (Note that there may be several pebbles 
on the same element). 

Duplicator wins the game if for each j < m we have that e G / is an 
s-partial isomorphism, where e = e±... e s are the elements marked by 
a±,... ,a s after the j'-tli move ( e* = * in case cq is off the board ) and 
where f — fi ■■■ f s are the corresponding values given by 0i,... ,/3 s . For 
j = 0 this means a i —y b is an s-partial isomorphism. The pebble game 
G s 00 {A,a,B^b) with infinitely many moves is defined similarly. We use 
G s m (A, B) as abbreviation for G s m (A, *...*, B, * ... *) and G^(^l, B) for 
G^iA,* .. .*,B,* .. .*). 

Example 3.1.3. Consider the orderings A := ({a, b}, <) and 
B := ({c, d, e}, <) where a < b and c < d < e. Spoiler has a winning 
strategy in G^(A,B). Spoiler first pebbles c in B and Duplicator has to 
pebble a in A with the corresponding pebble, because if it pebbles b in¬ 
stead, it will not be able to reply in the second move when Spoiler pebbles 
something greater than c with a second pebble. In the second move Spoiler 
pebbles e in B with a second pebble and Duplicator has to pebble b in A 
with the corresponding pebble. In the third move Spoiler pebbles d in B 
with the third pebble and Duplicator cannot reply because there is no ele¬ 
ment between a and b in A corresponding to the element d between c and 
e in B. 

Definition 3.1.4. 

Structures A and B are s-m-isomorphic, in symbols A = s w B , if there is 
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a sequence ( Ij)j<m °f nonempty sets of s-partial isomorphisms with the 
following properties : 

( s-forth property ) For j < m, a H y b E Ij+i, 1 < i < s, and a E A, 
there is b E B such that a“ i —> b% E 

( s-back property ) For j < m, a i—)■ 6 E I J+ \ , 1 < i < s, and b E B, 
there is a E A such that a“ i—^ 6^ G 

Z l J 

We then write (Ij)j< m '■ A — 0 £>. 

Definition 3.1.5. Structures *4. and /3 are s-partially isomorphic, in sym¬ 
bols .4 =p ari >£?, iff there is a nonempty set / of s-partial isomorphisms 
with the forth and back properties, respectively : 

For o 4 i G /, 1 < i < s, and a E A, there is b E B such that 
a^^bh E I. 

Z l 

For a 4 6 G /, 1 < i < s, and b E B, there is a E A such that 
a“ i-> p e /. 

z z 

We then write / : .4 =* art £>. 

Definition 3.1.6. For m E N, any structure A, and a E (A U {*} s ), the 
s-m- isomorphism type ip™ ( = s 0^0 ) °f ® i n A is given by : 

V4(^) := | ip atomic or negated atomic, and 4N^[a]}, 

:= /\ (/\3iw/>r, AVu, \/ «|)- 

l<z<s a£A a£A 

In particular, 0^ := 0™ * is an FO s -sentence of quantiher rank m. 

Definition 3.1.7. The sets 1400.4,13) and 4400.4, 13) of s-partial isomor¬ 
phisms corresponding to winning positions of Duplicator in the respective 
games are defined as follows : 

1400.4, B) := {a i->- b | Duplicator wins G s m {A, a, B,b)}, and 
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W^(A, B) {a ^ b \ Duplicator wins G^A, a, B, 6)} 

We write A = s m B to express that A and B satisfy the same F0 S - 
sentences of quantifier rank < m. 

The following theorem shows that the logics and the games fit to¬ 
gether. When we write A k </?[a] for a G {A U {*}) s we tacitly assume 
that the free variables of ip have indices in suppifa ) (that is, i G suppifa ) 
whenever Vi G free(ip)). 

Theorem 3.1.8. 

Let A and B be structures, and let a G (dU{*}) s andb G (i?U{*}) s with 
supp(a ) = suppfb). Then the following are equivalent : 

(i) Duplicator wins G s m (A, a, B,b). 

(it) a^be Wf(A, B) and (Wf(A, B))j< m : A = s m B. 

(Hi) There is ( Ij)j< m with a H y b g I m such that (. Ij)j< m ■ A = s m B. 

(iv) B k if^[b]. 

(v) a satisfies in A the same F0 s -formulas of quantifier rank < m as b 

in B. 

Corollary 3.1.9. 

(i) Duplicator wins G s m (A,B). 

(n) {W?{A,B))j<m-A^ s m B. 

(m) A = s m B. 

(iv) B k ififf. 

(V) ^ = S m B. 

From the previous theorem we deduce the following technique for prov¬ 
ing non-expressibility in FO [s,m] : 
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Remark 3.1.10. When we want to prove that some property is not 
expressible in FO [s,m] we may prove this by proving that there is a 
structure A satisfying the property and a structure B not satisfying the 
property such that Duplicator wins £>). We can also prove that 

there is a sentence in the logic FO [k,n\ not equivalent to any sentence in 
the logic FO[s, m] by proving that there are two structures A and B such 
that Spoiler wins G*(A,B) while Duplicator wins £>). 


3.2 Quantifier Rank and Number of Variables 

In this section we investigate the inclusion and strict inclusion relations 
among the logics FO [k,n\. 

Proposition 3.2.1. For every n > 0, 

FO[ 1, n] C FO[ 2, n] C FO[ 3, n] C ... 

Proof. The formula ip := /\ (yi ^ vf) belongs to FO [k + 1 ,n] for 

l<i<j<k-\-l 

every n but is not equivalent to a formula in FO[/c, n] for any n. A 
formula in FO[fc,n] can have at most k variables and thus, at most k free 
variables, and so for any satisfiable formula in FO [k,n] there is a first- 
order assignment satisfying it that gives the same value to V}. and ffc+i- 
This assignment, of course, does not satisfy <p. Thus is not equivalent 
to any satisfiable formula in FO [k,n]. Moreover, since (p is satisfiable, it 
is not equivalent to any contradictory formula. Hence <p is not equivalent 
to any formula in FO [k,n\. (Note that this also proves that for k' > k 
and n > n' , FO [k',n'} ^ FO [k,n\, and in general, FO 1 C FO 2 C ... ) □ 

Proposition 3.2.2. For every n > 0 and every k > 3, 

FO[k , n] C FO[k, n + 1] 

(This is a rephrasing of Proposition 6.15. in Immerman’s "Descriptive 
Complexity " W) 
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Proof. 

This proof is valid for both, structures with and without ordering. If 
one wants to consider the proof for unordered structures, then when we 
mention ordering in the proof one should understand it as the ordering 
of the names of the vertices, (but keep in mind that we are using that 
natural ordering of the names of vertices in the proof even when we are 
considering unordered structures). 

We will show that the property of directed graphs Q = (G,E g ,s,t) with 
two distinguished elements s,t saying that "the distance between s, t is 
2 n+1 " is expressible in F0[3,n + 1] but not expressible with a first-order 
formula of quantifier rank < n. And thus, we actually prove more than 
the statement of the proposition, in particular if k > k' > 3 and n' > n 
then FO [k',n'] ^ FO[/c,n]. 

Set ipo(x,y) := Exy, this is of quantifier rank 0 and it uses no vari¬ 
ables outside {x,y,z}. 

Assuming that the formulas ipi(v,w ) with v,w G {x,y,z} have been de¬ 
fined, and that they use variables only from {x,y,z\, and that they are 
of quantifier rank i, then 

iPi + i(x,y) := 3 z((fii(x,z) A <Pi(z,y)) is of quantifier rank i + 1 and uses 
variables only from the set {x,y,z}. 

The other ifi + i(v,w) with v,w G {x,y,z} can be defined similarly. 

It can be easily seen, by induction, that (p n+ i(s,t) expresses that the dis¬ 
tance between s,t is 2 n+1 . 

Now we show that Duplicator wins G n (A,B), where 
A = ({0 ,..., 2 n+1 + 1}, {(i, i + 1) | 0 < i < 2 n+1 }, 0, 2 n+1 + 1) and, 

B = ({0,..., 2 n+1 j, {(i, i + 1) | 0 < % < 2 n+1 - 1}, 0, 2 n+1 ). 

Let the dj-distance be defined as in Example 11.3.141 
For 0 < j < n, set 

Ij := {p G Part(^l, B ) | p preserves order and dj + \(a, a') = dj + i(p(a),p(a')) 

for all a, a 1 G do(p)} 

dj + i(s A ,t A ) = oo = dj + i(s B , t B ) for all 0 < j < n. So 
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{(s* 4 , s B ), (t A ,t B )} G Ij for all 0 < j < n. Hence Ij ^ 0 for all 0 < 
j < n. Suppose j < n, p G Ij+i, a G A, If there is a' G do(p) such that 
dj + i(a, a') < 2 J+1 , then there is exactly one b G B for which pU{(a, 0} 
is a partial isomorphism preserving order and dj+i-distances. 

If there is no such a', let do(p) = {cp, ..., a m } with cp < ... < a m . 
Then for some l<i<m — 1, at<a< a i+ 1 , and dj + i(a,i,a) = oo, 
dj + i(a,di + i) = oo, hence dj + 2 (ai, ai+i) = oo and therefore, 
dj + 2 (p(cp),p(cp+i)) = oo. Thus there is a b such that p(a,i) < b < p(a,i + 1 ), 
dj + i(p(a,i),b) = oo and dj + i(b } p(a i+ i)) = oo. It is easy to see that 
pU{(a, 0} is a partial isomorphism in Ij. 

(Note that here when j = 0 that 

"dj +2 (ai, a i+ i) = oo and dj + 2 (p(cp),p(a i+ i)) = oo" means that 
d(di, a i+ i), d(p(ai),p(di + i)) > 4 and so Duplicator can find a vertex be¬ 
tween p(ai),p(di+ 1 ) that is not adjacent to any one of them as a is not 
adjacent to any of a*, ep+i). 

The back property can be proven similarly. □ 

Proposition 3.2.3. 

There is a property of words expressible in FO[ 2, 2 n + 2] but not expressible 
in FO[k,n\ for any k. 

Before we give a proof we need first to mention some definitions and 
facts. Let E be a finite alphabet and let E* denote the set of finite words 
over it. 

Definition 3.2.4. (Boundary Position) [T2] 

A boundary position denotes the first or last occurrence of a letter in a 
given word. Boundary positions are of the form d a where d G {t>, <} 
and a G £. The interpretation of a boundary position d a on a word 
w = W\...w\ w \ G E* is defined as follows : 



If there are no occurrences of a in w then we set d a (w ) to be undefined. 
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A boundary position can also be specified with respect to a position q G 


{ min{i G [q + 1, |w|] | Wi = a} if d — > 

max{i G [1, q — 1] | Wi = a} if d — < 

Definition 3.2.5. (Rankers) p2j 

Let n be a positive integer. An n-ranker r is a sequence of n boundary 
positions. The interpretation of an n-ranker r — (p i, ...,p n ) on a word w 
is defined as follows : 


r(w) 


Pi(w) if r = (pi) 

< undefined if (pi,... ,p n -i)(w) is undefined 

Pn(w, (pi,... ,p n -i)(w)) otherwise 


Instead of writing n-rankers as a formal sequence (p\ ,... ,p n ), we often 
use the simpler notation p 1 . . ,p n . We denote the set of all n-rankers by 
R n , and the set of all /r-rankers that are defined over a word w by R n (w). 

Theorem 3.2.6. (Expressibility of the Defined-ness of a Ranker, Weis 
and Immerman 2007 

Let n be a positive integer, and let r G R n . There is a formula 
ip r G FO[ 2, n] such that for all w G E* ; w N ip r r G R n {w). 

Now we prove Proposition 13.2.31 : 

Proof. The property of "the defined-ness of the 2 n +2-ranker >i >i ... >i" 

s ->v- ' 

2 n +2 

is expressible in FO[2,2 n + 2] but not expressible in FO [k,n] for any k. 
From the theorem we have just mentioned of Immerman and Weis , this 
property is expressible in FO[2,2 n + 2]; but from the proof of Exam¬ 
ple 11.3.141 (of the non-expressibility of evenness of ordered structures in 
first-order logic), Duplicator has a winning strategy on the following two 
strings in the n-round game : 

Wi and W 2 strings of l’s of lengths 2 n + 2 and 2 n + 1 respectively (so the 
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ranker >1 >1 ... >1 is defined in w\ but not defined in wf)- 

v ^ 7 

V* 

2 n +2 

It is important to note that in Example 11.3.141 the ordering vocabulary 
contained max and min , but here the vocabulary of strings does not nec¬ 
essarily contain these constants. The presence of these constants made it 
possible, in the proof of the second case for the forth property in the exam¬ 
ple, to restrict ourselves to the case when a is between two elements a* and 
cij + 1 from the domain of the partial isomorphism. To recover this we may 
change the definition of Ij so that it assumes that the first element and 
the last element of A are in the domain of the partial isomorphism. □ 


3.3 Infinite Disjunctions and a Revisit to Depth 


In this section we present inhnitary logic (logic with infinite disjunctions). 
All the fixed-point extensions of first-order logic that we have exhibited 
are contained in inhnitary finite variable logic. Provoked by this, we 
suggest a rough relationship between depth and the number of disjunc¬ 
tions. Then we introduce a new complexity measure FO \j[f(n), g(n)\ 
which counts the number, fin), of V-symbols, and the number, g(n ), of 
variables, in first-order formulas needed to express a given property. We 
prove that for f(n) > logn, NSPACE[/(n)] C FO v [/(n) + , j&£], 

and that for any f(n),g{n ), FO\f[f[n), g{n)] C DSPACE[/(n)g(n) logn], 
and as a corollary we have : 


U FO vK + 

k> 1 




F0[2 n ° (1) ]. 


Definition 3.3.1. (The Inhnitary Logic Loo^) 

Let r be a vocabulary. The class of Loo^-formulas over r is given by the 
following clauses : 

- it contains all atomic hrst-order formulas over r 


- if (p is a formula then so is ->ip 

- if ip is a formula and x a variable then 3 xg> is a formula 


59 








- if T is a set of formulas (possibly infinite) then \f T is a formula. 

The semantics is a direct extension of the semantics of first-order logic 
with \f T being interpreted as the disjunction over all formulas in T; 
hence, neglecting the interpretation of the free variables, 

A b T iff for some E T, A h ijj. 

We set /\ T := —> VI - 'V 7 I V’ £ Then f\ T is interpreted as the 

conjunction over all formulas in T. By identifying (p V ip) with \/{p, 
we see that L^ is an extension of first-order logic. 

In finite model theory, when talking about an Loo^-formula we may 
assume that it contains only a countable number of disjunctions as the 
following proposition suggests : 

Proposition 3.3.2. fffj In the finite, every L^- formula p(x) is equiva¬ 
lent to an Loo^-formula ip(x) with only countably many disjunctions and 
whose free variables are from the free variable of ip. 

Proof. By induction on the rules for Lo^-formulas. The translation pro¬ 
cedure preserves the "structure" of formulas and only replaces inhnitary 
disjunctions by countable ones. In the main step suppose that 

<p(x) = \JWi{x) | i el} 

is an Loo^-formula. For each finite structure C with universe {1, 2,... , | \C \|} 
and each c E C, if there exists i E I such that C t= pi [c], choose such an 
%. Let Jo be the set of V s chosen in this way. Then I 0 is countable and 
\J{pi(x) | i E 1} and \J{pi(x) \ i E J 0 } are equivalent in the hnite. □ 

Definition 3.3.3. (Inhnitary Finite Variable Logics 
^ooui i s th e fragment of L^ of formulas that use at most s distinct vari¬ 
ables (free and bound); and L‘^ ow := (J L^. 

s> 1 

Theorem 3.3.4. Jjjfl Let A andB be structures, and let a E (LlU{*}) s and 
b E (FU{*}) S with suppla ) = supp(b). Then the following are equivalent: 
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ft) Duplicator wins G^ >0 (A, a, B,b). 

(n) a^beW^(A, B) and W^A, B) : A = s part B. 
(Hi) There is I with a H y b e I such that I : A = s part B. 
(iv) a satisfies in A the same -formulas asbinB. 
Corollary 3.3.5. fBj The following are equivalent : 

(i) Duplicator wins G^AjB). 

(a) W^(A, B) : A =‘ parl B. 

(Hi) "A =part fi¬ 
fty) A = L °°“ B. 

Theorem 3.3.6. jij/ 


FO(PFP) C 

For a proof we first prove the following lemma : 

Lemma 3.3.7. 

Let ip(X,x) be a first-order formula where all variables are among v\,.. ., Vi¬ 
and X. Suppose X is s-ary and x = X\ .. .x s (with x\,...,x s among 
Vi, ... ,Vk). Then for every n, there is a formula tp n (x) in FO k+s defining 
the stage F%. 

Proof. 

Let 2/1 = Vk+i,... ,y s = Vk+ S - Then <p n (x ) can be defined inductively by 

<P°(x) ■= -<xi = Xl, 

(i.e., we replace in ip(X,x) each occurrence of an atomic subformula of 
the form Xt by 3 y(y = t A 3x(x = y A (p n (x))); note that some variables 
of x may occur in t). □ 
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Now we prove FO(PFP) C L^. 

Proof. 

By Proposition 12.1.21 it suffices to show for first-order ip that [PFPx^jt 
is equivalent to a formula of L^. 

So suppose that 92 , k,s,X, x, y are as in the preceding Lemma. Then 
[PFPx,s</?]t is equivalent to the L^J-formula (we may assume without 
loss of generality that the variables in t are in {aq,..., x s }) 

V Wp" (z) ** P n+1 (x)) A <p n (ft)) 

n> 0 

where, to stay within L^J, we take 3 y(y — t A 3x(x = y A (p n (x))) for 
<p n (t). □ 

This, and the idea of the dependence of the depth (or the number of 
stages to compute the fixed point) on the size of the structure, provoke 
us to break this L^,-formula into a sequence of first-order formulas - 
with the same upper bound on the number of variables - in which the 
number of disjunctions is a function of the size of the structure (Recall 
the dependence of the number of iterations of the quantifier block on the 
size of the structure). So for every n > 1 , every structure A of size n : 

t(n) 

a t= [pfp x ,m<p]i ++ V (y^ty <z) ° y +1 (xo a ip i (t)) 

i=0 

where t{n) is the depth of ip(X,x). 

To make this more elaborate we define the following complexity classes: 

Definition 3.3.8. 

Let FOy[/(n)] denote the set of all classes of structures definable by a 
uniform sequence of FO fc -sentences in which the number of disjunctions 
is 0(f(n)), more precisely, a class S of structures is in FOy[/(n)] iff there 
is a sequence ipi, <^ 2 ,... of FO fc -sentences such that : 

( 1 ) For every positive integer n, every structure A of size n : 
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A e S iff A N (p n . 


(2) There is a constant number c such that, for every positive integer 
n, the number of V-symbols in tp n is at most cf(n ) (assuming that 
the formulas are expressed using V’s and —«’s only). 

(3) The map n i —> ip n is generable by a DSPACE[/(n)] Turing machine. 

The number of V-symbols in the formula ip°(x) is 0, and their number 
in </5 1 (T) is l + 2m where / is the number of V-symbols and m is the number 
of occurrences of X in <p(X,x) (Note that the formula 
3 y(y = t A3x(x = y Aip°(x))) with which the subformulas Xt are replaced 
contain exactly two A’s and no other binary connectives and hence, it 
contains exactly two V’s when it is written using only V’s and ->’s). It can 
be easily seen that the number of V-symbols in (p 2 (x) is / + m(2 + 1 + 2m) 
i.e. I + (l + 2)m + 2m 2 , and in cp 3 (x) is l + m(2 + / + (/ + 2 )m + 2m 2 ) 
i.e. I + (/ + 2)m + (/ + 2)m 2 + 2m 3 , and then by induction the number of 
V-symbols in (p l (x) is 

l + (Z + 2)m + (/ + 2)m 2 + ... + (/ + 2)m 1 " 1 + 2m* for any i > 4. 

Let h{i) denote the number of V-symbols in ip 1 (x). Then the number of 
V-symbols in the formula 

t(n) 

Vn ■■= V (Vx(cp % (x) fV ^ + 1 (T)) A <p*(t)) 

j=0 


t(n) 

is S (4 + 2h(i) + h(i + 1)) i.e. 

i=0 


(4 + / + (31 + 4)t(77.)) + (8 + l -f- (3/ + 6)(t(n) — l))m + 

(8 + l + (3/ + 6)(t(n) — 2))m 2 + (8 + l + (31 + 6)(t(n) — 3 ))m 3 + ...+ 

(8 + / + (3/ + 6))m t W“ 1 + (8 + /)m t ^ + 2 m*W +1 , 

Call this /(n). (Note that /(n) = @(2 c *^ n )) for some constant number c 
if m >2). It is not hard to see from the construction of the formulas ip n 
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that their generation does not need space more than 0(f(n)), thus, the 

class of structures definable by [PFPx^jt (under some interpretation of 

t) is in FOw +s [/(n)]. Hence FO(PFP) C \J FOt[ 2° (2 " J) ] (Recall that if 

j,k> i 

the relation variable through which the fixed-point is built is of arity j 
then it takes at most 2 nJ stages to come to the fixed-point if we do not 
restrict the formula to be positive in the relation variable, so the depth 
t(ri) of any formula is 0(2 nJ ) for some j). 

We think that the number of V-symbols is worth studying as a complexity 
measure. Also it is important to investigate the relationship between 
IND[t(ra)] (or FO[t(ra)]) and FOy (1) [2°W n »]. 

Likewise we can also define the complexity classes FOy[/(n), g{n)\ in 
which the number of variables depends on the size of the structure : 

Definition 3.3.9. Fix a vocabulary a. Assume that formulas are ex¬ 
pressed using V’s and —«’s only. We say that a class, S, of cr-structures is 
in FOy[/(n), g(n)\ if there exists a sequence of sentences {<p t | z = 1,2 ,...} 
from FO[cr], and constant numbers, k,l , such that : 

(1) For all cr-structures, A, if ||*4|| = n, then : 

A E S iff A 1= ip n - 

(2) (p n has < kf(n ) V-symbols and uses < lg(n ) distinct variables (free 
and bound). 

(3) The map n i —> ip n is generable by a DSPACE[/(n) + g(ri)\ Turing 
machine. 

Theorem 3.3.10. For f(n) > logn, (under Proviso \2.2.1\) . 

i V ^/(n)]CFO v [/ W+ (M) 2 ,M ] 

In particular, 

NL C FOy[logn, 0(1)} 
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Proof. (In this proof when we write an expression of functions, for exam¬ 
ple logn or we mean its ceiling). Let S C STRUC[u] be a boolean 
query in NSPACE[/(t 7)]. Let M be a nondeterministic f(n )~space ma¬ 
chine that accepts S, i.e., 

AeS iff M(bin(A )) |. 

Let m be such that M uses at most 777 /( 77 ) bits of work-tape for inputs 
of size n. Let cr = {_R 1; ..., R r , ci,..., q} where each Ri is of arity a* 
and let a = max{ai | 1 < % < r}. Let h(n) = m t(n) = , 

and g(n) = 4 + a + h(n ) + t(n). Let A be a cr-structure of size n. A 
configuration, or an instantaneous description, of M’s computation on A, 
can be coded as a g(n)-tuple of variables : 

(q, w i,..., w h{n) , s, ri,..., r a ,p, v t ,..., v t{n) ,v) 

The variable q encodes the state of the machine.The variables w±,, Wh( n ) 
encode the contents of M’s work-tape. Remember that each variable 
represents an element of *4.’s 77-element universe, so it corresponds to a 
log 77-bit number. The variable s determines at which relation, constant, 
or otherwise, the input-head is looking. The variables r 1; ... ,r 0 encode 
where in one of the input relations the input-head is looking, if it is look¬ 
ing at a relation. The variable p determines the number of the bit being 
read in one of the input constants if the input-head is looking at a con¬ 
stant. The variables V\, ... ,v t ( n ) encode the index of the variable where 
the work-head is looking. (Note that t{n ) variables together can name 
riA n ) , i.e. h(ji) , things). The variable v determines the number of the bit 
being read in a variable of wi ,..., %(„). 

We may assume without loss of generality that (0, 0,..., 0) is the unique 
start configuration of M and (max, max ,..., max) is its unique accept 
configuration. We denote the configuration (0,..., 0) by 0 and 
(max,..., max) by max. 

Thus the size of the configuration graph for an input structure of size n 
is at most n 9 ^ ("at most" because not all tuples necessarily represent 
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configurations). A G S if and only if M(bin(A )) i if and only if there 
is a path from 0 to max in the configuration graph of M’s computation 
on A, i.e., if and only if the configuration graph satisfies the sentence 
<p g (n) io g n[0, max] where the formulas pi(x,y) are defined inductively as 
follows : 

<Po{x,y) ■= Exy and, 

ifii + i(x,y) : = 3zVuVu(((u = xAv = z)\/(u = z/\v = y)) —$■ <pi(u,v)). 
It can be easily proved by induction that the number of V-symbols in (pi 
is Ai and that it expresses the existence of a path from x to y of length 
< 2*. Hence, in particular, </? s ( re )i ogn [0, max\ has Ag{n)\ogn (= 0(/(n)) 
as a function of n) V-symbols and expresses the existence of a path from 
0 to max of length < 2 9 ^ logn , i.e. n 9 ^ n \ in the configuration graph. 
Since the configuration graph is of size at most n 9 ^ , then p g ( n ) logn [0, max] 
expresses the existence of a path in general from 0 to max in it. It re¬ 
mains now to retrieve from <p 5 ( n )iogn[0, max] the corresponding sentence 
in the vocabulary of A and show that the number of V-symbols in it is 
0(f(n) + ) and the number of variables is 0(^^). (Note that 

any (p t contains only 5 variables x,y, z,u,v). We should show that this 
sentence depends only on the size of A but not on A itself. Then we 
should show that the sequence of sentences we have is generable by a 
deterministic f(n) + -space machine. 

This sentence is obtained from <p g ( n )iogn[0, max] by replacing the unique 
occurrence of an atomic subformula of the relation E with the formula 
w, s, r, p, v, v, q', w', s', r' , v' , v') expressing that there is an edge from 
(q, w, s, r , p, v, v ) to ( q w ', s', r', v', v') in the configuration graph, and re¬ 
placing each variable by a g(n)-tuple of variables. We do not need to re¬ 
strict quantifiers so that their variables represent configurations because 
the unique occurrence of ipE in the core of the sentence (note that E 
occurs only once in the core of pi) is already restricting the vertices on 
the path claimed by <p 9 (n)iogn[0, max] to represent configurations. This 
occurrence of ensure that the transition function of the machine can 
make it go in one step from configuration 0 to the situation represented 
by the next vertex on the path, and hence this vertex represents a con- 
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figuration, and so on up to the end of the path. 

There is an edge from (q,W,s,r,p,v,v) to (q',w', s',r',v',v') in the con¬ 
figuration graph of M’s computation on A if and only if M can go in 
one step from the configuration represented by (q,w,s,r,p,v,v) to the 
configuration represented by (</, w', s ', r', v', v'). 

i/je is a disjunction over M’s finite transition table. A typical entry in the 
transition table is of the form ((x,b,w),(x / ,id,w > ,Wd))- This says that 
in state x, looking at a bit that equals b with the input-head and a bit 
that equals w with the work-head, M may go to state x', move its input- 
head one step in direction id, write w' in the cell where it is looking in 
the work-tape and move its work-head one step in direction Wd- Let us 
describe the disjunct corresponding to ((x, b, w), (x 1 , id, w', Wd)) if b = 1, 
w — 0, w' — 1, id is right, and Wd is left. This disjunct is the conjunction 
of : 

(1) 'ipi : A formula saying that q equals x and q' equals x'. 

(2) 02 : A formula saying that the input-head was reading a 1 in some 
cell and is now looking at the next cell. 

(3) 0 3 : A formula saying that the work-head was reading 0 in some cell 
and now this cell contains 1 and the contents of all the other cells 
are unchanged, and the work-head is now looking at the previous 
cell. 

0! is the conjunction of the formulas BIT(g, i) for the bits i which are 
1 in x, and the formulas ->BIT(g, i) for the bits i which are 0 in x, and 
the formulas BIT(g',f) for the bits i which are 1 in x', and the formulas 
-iBIT(g',f) for the bits i which are 0 in x'. Since the number of states of 
the machine is finite, t say, then any state can be coded in log t bits, and 
hence this formula contains 21ogt conjunctions (logf for q and logt for 
q'). Hence this part of 0g contributes a constant number of V-symbols 
that is independent of the size of the structure. 

02 is a finite disjunction (independent of the size of the structure), since 
the number of relation symbols and constant symbols is finite, considering 
the different cases for the position of the input head: 
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(a) It was looking at some relation at a tuple before the last and is now 
looking at the next tuple of the same relation. This is determined 
by the values of s, s', r 1 ,..., r a , and r[,..., r' a . (If it was looking at 
relation Ri then that it was reading a 1 is expressed by R t r\ ... r ai ). 

(b) It was looking at the last tuple of a relation and is now looking at 
the first tuple of the next relation or at the first bit of c±. This is 
also determined by the values of s,s',r 1 ,..., r a , and r[,..., r' a , plus 
possibly p. 

(c) It was looking at a constant at a bit before the last and is now 
looking at the next bit of the same constant, or it was looking at 
the last bit of a constant and is now looking at the first bit of the 
next constant, or is now outside the range of bits of the input from 
the right. This is determined by the values of s,s',p, and p'. (If 
it was looking at the constant c* then that it was reading a 1 is 
expressed by BIT(q,p)). 

-03 is a disjunction depending on the size of the structure : 

h(n) 

\J (-iBIT(wj, v) A BIT(w;', v) A v' — v — 1 A ( Wj — w'j )A 
i=1 j;j¥=i 

t(n) 

Wx(x v —> (BIT(inj,x) -H- BIT(w;', x))) A (f\vj = u')) 

3 =1 

This is the disjunction for the case when the work-head is looking in a 
variable of W\,... ,Wh ( n ) at a bit other than the first bit of it, the other 
cases can be treated similarly. This contributes a number of ©((h(n)) 2 ) 
V-symbols to i/je- 

Noting that E occurs only once in any of the formulas ipi, then the number 
of V-symbols in the n-th sentence is Q(g(n) logn + (h(n)) 2 ), i.e., 

0(/(n) + )• The variables used in the formula are x, q, W \,..., 

w h ( n ),s,r 1 ,.. .,r a ,p, vi, .. • ,vt ( „),v,g',tyi, • • ■, w' h{n) , s', r[,... ,r' a ,p', 
v [,..., v' t ( n j,v', i.e., ©() variable, and the formulas ipi use only a hnite 
number of variables, hve, and E occurs only once in it, hence the number 
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of variables in the resulting n-th sentence is 

It can be easily seen that this n-th formula depends only on the size of A 
and not on A itself. Again from the construction of the formulas (pi and 
the fact that E occurs once in any of them, it can be easily seen that it 
takes @(/(n) + )-space to generate the n-th sentence. □ 

Theorem 3.3.11. For any functions f(n),g(n), funder Proviso \2.2.1\) . 
FO\j[f(n),g(n )] C DSPACE[f(n)g(n) logn] 

For a proof, we need first to prove the following lemma : 

Lemma 3.3.12. Every first-order formula with k V -symbols and using 
m variables is equivalent to a first-order formula in which the number of 
quantifiers plus the number of free variables is at most m(k + 1). 

Proof. By induction on k. When the number of V-symbols is zero, the 
formula is something of the form : 

QnX h ... Q^x^Rx i... x m , or Q^x^ ... Q^x^Rx i... x m . In each one of 
these formulas the number of quantifiers plus the number of free variables 
is m. 

For some k > 0, suppose the statement is true for FO ? "-fromulas with < k 
V-symbols, and consider an FO m -formula with k + 1 V-symbols. Such a 
formula is equivalent to a formula 6 of the form Q^x^ ... QyX^ (-> )(ip\/if), 
where (->) means that there may be or may be not a negation, and where 
(p is a formula with r V-symbols and r' free variables, and fit is a formula 
with s V-symbols and s' free variables, and r + s = k. By the induction 
hypothesis, the number of quantifiers plus the number of free variables 
in (p is < m(r + 1), and the number of quantifiers plus the number of 
free variables in if is < m(s + 1). Thus, the number of quantifiers in <p is 
< m{r + 1) — r' and the number of quantifiers in if is < m(s + 1) — s'. 
Hence the number of quantifiers in 6 is < l + m(r +1) — r' + m(s + 1) — s'. 
The number of free variables in 9 is < r' + s' «- l. Thus, the number of 
quantifiers plus the number of free variables in 6 is at most / + m(r + 1) — 
r' + m(s + 1) — s' + r' + s' — l, i.e., m{r + s + 2), i.e., m(k + 2). □ 
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Now we prove Theorem 13.3.111 


Proof, let S be in FOy[/(n), g(ri)\, then S is definable by a sequence of 
sentences ipi, ip 2 , ■ ■ ■ generable by a DSPACE[/(n) + g(n)\ machine such 
that <p n has 0(f(n)) V-symbols and uses 0(g(n)) variables. Hence, by the 
previous lemma, ip n has 0(f{n)g{n )) quantifiers, i.e. S is in QN[f(n)g(n)] 
(QN denotes Number of Quantifiers, cf. Immerman’s paper "Number of 
Quantifiers is Better than Number of Tape Cells" 0). The proof of 
the second inclusion of Theorem 2 in that paper implies that S is in 
DSPACE[/(n)g(n) logn]. □ 

As a direct corollary of these two theorems. 13.3. 10l and l3.3. 3~T1 we have, 
Corollary 3.3.13. 

( /c \ ^ 

—— ) , ] = F0\ 2 n ° (1) ] = PSPACE = FO(PFP) 

log n J log n 

The last two equalities were proved by Immerman and Vardi (Theorem 
12 . 2 . 111 ) . 

3.4 More on Pebble Games 

In this section we talk more about pebble games and depth. We ask 
a question about the depth of the formulas ip s ^ whose fixed-points in 
a structure A are the sets of winning positions (a,b) for Spoiler in the 
games (^(.A, a, A, b). We conclude the section with a conjecture relating 
the existence of an upper bound (a function) on the depths (in the struc¬ 
tures of a class of finite structures) of positive first-order formulas to the 
bounded-ness with the same bound, over the same class, of another char¬ 
acteristic of finite structures called the .s-rank, and relating the existence 
of both bounds to the expressibility of inhnitary finite variable formulas 
in IND[t(ra)], where t[n) is the claimed bound. 
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Obviously, for any structures A and B, 


Wq(A, B) D W*(A, B)D... 

and since there are at most (||*4.|| + l) s .(11i3|| + l) s s-partial isomorphisms 
from A to B, then there is an m < (||^4|| + 1) S .(||S|| + l) s such that 

w^(a,b) = w^ +1 (a,b). 

For A = B the minimum such m has a name : 

Definition 3.4.1. 

The minimum m such that W^(A,A) = W^+^A, A) is called the s-rank 
of A, and is denoted by r(s,*4.), or r(A) for short. 

Definition 3.4.2. 

Let K be a class of structures. We say that K is s-bounded if the set 
{r(*4) | A G K} of s-ranks of structures in K is bounded. The class K is 
bounded if it is s-bounded for every s > 1. 

Fix s, and let 

a ~ b iff a and b satisfy the same L^-formnlas in A 

We show that ~ is definable in FO(LFP). Let a and b range over A s . For 
j > 0 define on A s by induction : 

a ~o b iff a and b satisfy the same atomic formulas in A 

a 1 b iff a ~o b and for all i — 1,..., s and all a G A {b 6 A) 

there is b E A (a E A) such that a— b- 

i i 

Obviously, a b iff Duplicator has a winning strategy in the pebble 
game G s -{A, a, A, b) with s pebbles and j moves. Clearly, • • •, 

so that for some l (l is the s-rank r(^4)). For such an l we have 

~/=~. For the complements of we have 


^qC'S'j C . . . 
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They are the stages Ff s ’*, F % s '*,..where tp s ^(Z, x±, ..., x s , yi, ..., y s ) 
is the following formula positive in Z : 


V ++ -^J>(y)) V \J (3 XiWyiZxy V ^yfdxiZxy). 


4>e fo s 


1 <i<s 


if atomic 


Then the FO(LFP) formula z,x,^Ps,*{Z, x, y)]xy expresses x ~ y. 

Open Question: The fixed-point of the formula 
(p s ,*(x i, ..., x s , 2/1 ,..., y s , Z) is in IND[n 2s ]; is this fixed-point expressible 
in IND[n r ] for some r < 2s ? 

If not then we have the strict hierarchy 


IND [n 2 ] C IND[n 4 ] C IND[n 6 ] C ... 


Open Question : Is there a structure A for which r(s, A) is (| |.A| | + l) 2s ? 

Definition 3.4.3. [2] (Scott Formulas) 

For given a, the formula 




f>e(Au{*}) s 


(more exactly, <Tq = s (Ta,o) is called the s-Scott formula of a in A. It is an 
FOMormula of quantifier rank r(A) + 1 + s. In particular, a a '■= cr*_* is 
an FO s -sentence. 

These formulas capture the whole L^-theory of A : 

Theorem 3.4.4. J^j 

Let A be a structure. 

(a) For any structure B, 


B N a.4 iff A = L °°“ B. 


(b) For a € (A U {*}) s , any structure B and b e (B U {*}) s with 
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supp(a) = supp(b), 

B N a-a\b] iff a satisfies in A the same L^-formulas as b in B. 

Corollary 3.4.5. JM/ 

Each L^-formula p is equivalent to a countable disjunction of FG 3 - 
formulas. In fact, p is equivalent to the L^-formula 
\f{<Ja | A is a structure , a G A s , A N </?[a]}. Moreover, if K is any class 
of structures, then p and I A E K, a E A, A\= p\a]} are equivalent 

in all structures of K. 

As an application of the Scott formulas we present a condition for 
and FO s to coincide in expressive power, but first we need the following 
lemma : 

Lemma 3.4.6. For structures A and B, if WJ(A, A) = Wf +1 (A, A) 
and A = L B, then Wf(B. B) = Wf +1 (B, B). Hence : if A = L B then 
r(A) = r(B). 

Proof. Suppose that Wf(A,A) = Wf +1 (A, A) and A = L B. 

Put p := if J A A A Vui ... yvsifjl 

a&(Au{*}) 3 

Since Wf (A, A) = Wf +1 (A,A), then A h p, and since A = L °°“ B, then 
also B N p. B N implies : 

>*...*6 WJ(A, B) (1) 

and B t= A Vui ... Vu s (V4 t/4 +1 ) implies : 

a€(Au{*}) s 

w;(a,B) = w; + 1 (a,b) ( 2 ) 

From (1) and (2) it follows that 

Wf{A,B) : A = s part B (3) 

From (1) and (3) it follows that for any b G (B U {*}) s there is 
a G (A U {*}) s such that og6g Wf(A, B). 
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Let b I—)• b' G W- (13,13), then b and b' satisfy in B the same formulas of 
FO s of quantifier rank < j. There are a and a' such that a b and 
a’ i —y b' are in Wf(A,B). Then a and a 1 satisfy in A the same formulas 
of F0 S of quantifier rank < j. Thus a H > a 1 is in Wf{A,A) and hence 
in Wj +1 {A,A), so a and a' satisfy in A the same formulas of F0 S of 
quantifier rank < j + 1 . Since Wf (A. B) = Wf +1 {A,B) then b satishes 
in B the same FCF-formulas of quantiher rank < j + 1 as a in A, and b’ 
satishes in B the same FO s -formulas of quantiher rank < j + 1 as a' in 
A. Hence b and b' satisfy in B the same FO s -formulas of quantiher rank 


<j + 1. Thus b V e W- +1 (B, B). 


□ 


Theorem 3.4.7. JEj 

(a) For s > 1 the following are equivalent: 

(i) K is s-bounded. 

(ii) On K, every L^^-formula is equivalent to an F0 s -formula. 
(Hi) On K, every -formula is equivalent to an FO-formula. 

(b) K is bounded iff FO and have the same expressive power on 


K. 


Proof. 


As (b) is a consequence of (a), it suffices to prove (a). First suppose that 
K is s-bounded and set m := sup{r(s, A) \ A G K} < oo. Thus, for 
A G K and a in A, the quantiher rank of a-a is < m + s + 1. Let </? be 
any L^-formula. Then the disjunction in the preceding corollary is a 
disjunction of formulas of quantiher rank < m + s + 1 and hence, it is 
a finite one. This shows that (i) implies (ii). The implication from (ii) 
to (iii) is trivial. To show that (iii) implies (i) assume, by contradiction, 
that K is not s-bounded. Let Aq,A\,... be structures in K of pairwise 
distinct s-rank. For M C N let 
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By the previous lemma, if L, M C N and L ^ M then K Z (fL GG <pm■ 
Hence on K , contains uncountably many pairwise nonequivalent 

sentences and is therefore more expressive than FO. □ 

Theorem 3.4.8. /2j/ 

Let K be a class of structures. Let K be fixed-point bounded, i.e., for 
any first-order formula ip(X,x) positive in X with free variables among 
x, X, there is an mo such that 

K 1= \/x(tp mo+l (x) -G v mo (x)) 

Where (p m (x) is a first-order formula defining Ff). 

Then the following are equivalent : 

(i) K is fixed-point bounded. 

(ii) K is bounded. 

(Hi) On K, every Lf^^-formula is equivalent to an FO-formula. 

Proof. 

(i) =>- (ii) : 

Assume that K is fixed-point-bounded. Recall the formulas : 

<p s ^(Z,x,y) := \J (if(x) EE -1-0(17)) v \J ^xfiy^xy V 3yflx i Zxy) 

V>6 FO s l<i<s 

ijj atomic 

where x:=xi...x s , y := yi... y s . 

Since K is fixed-point bounded then there is mo such that 
K h VxVy(^™° +1 (x,y) <p^.(x,y)) 

this is equivalent to K N VlXy(-itp™^(x,y) —> (fx, y)). Thus for ev¬ 

ery A E K , every a,b E A s , if Duplicator wins G s mo (A, a, A, b ) then it wins 
G m 0 +i (A,a,A,b). Hence for every A E K, Wf o (A,A) = W^ o+l (A, A). 
So K is s-bounded. s was arbitrary, so K is s-bounded for every s, i.e., 
K is bounded. 
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(ii) <S=> (Hi) : was proved in Theorem 13.4.71 
(in) =>■ (i) : We prove (- 1 ( 2 ) =>• ->(iii)) 

Let ip(X,x) be a first-order formula positive in X with free variables 
among x, X. Assume that there is no m such that 

K b Vx(p> m+1 (x) -A tp m (x)) 

i.e. for every m, there is A m £ K such that 

A m ^3x(v m+1 (x)A^ m (x)) (*) 

For any M C N, set tpM ■— V (</? m+ 1 (T) A -^ip m (x)). If M ^ L, we 

mGM 

may assume without loss of generality that there is mo G M\L. From (*) 
A mo N 99 m ° + 1 (a) A -itp m °(a) for some a from A mo . Then A mo N <Pm(o), 
and Am 0 ^ V^Oh) f° r a U l > + 1, and A mo t= -up 1 (a) for all l < mo- 

For any l G L either l > mo or l < mo, thus A mo ¥ ip l+1 (a) A ~<(p l (a) for 
all l G L. Hence A mo ¥ <Pl- 

Thus for L ^ M , p>M and <pi are not equivalent on K. So on K there 
are uncountably many pairwise nonequivalent L^-sentences, hence the 
expressive power of on K is greater than that of FO. □ 

Inspired by this we introduce the following more general definition for 
fixed-point bounded-ness : 

Definition 3.4.9. 

For a class K of structures and polynomially bounded t(n), we say that K 
is t(n)-fixed-point bounded if for any first-order formula <p(X,x) positive 
in X with free variables among x, X, there is a constant number c such 
that for every positive integer n, and every structure A of size n in K, 
the depth of p in A is < ct(n). 

and then we suggest the following conjecture which is a generalization 
of the previous theorem : 
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Conjecture 3.4.10. For every class K of finite structures, every poly¬ 
nomially bounded t(n), the following are equivalent: 

(i) K is t{n) -fixed-point bounded. 

(ii) For every s, there is a constant number c such that for every positive 
integer n, every structure A of size n in K, 

r(s, *4.) < ct{n ). 

(hi) On K, every L^^-formula is equivalent to an IND[t{n)\-formula. 
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4 Conclusion and Future Re¬ 
search 


4.1 Conclusion 

Depth is an important complexity measure that has several characteriza¬ 
tions : 

(1) It equals the number of iterations of a first-order quantifier block 
when the inductive definition is positive, Theorem 12.2.141 (If the 
inductive definition is not positive, and hence its depth may be 
exponential, then the inductive definition is expressible by a first- 
order quantifier block iterated exponentially, Theorem 12.2.ill) . 

(2) The class of boolean queries definable by positive inductive defini¬ 
tions (which are of polynomial depth) is exactly the class of boolean 
queries accepted by a deterministic polynomial-time Turing ma¬ 
chine, Theorem 12.2.21 

(3) It equals parellel-time when the inductive definition is positive, The¬ 
orem 12.2.141 

(4) It equals circuit depth in Circuit Complexity, Theorem 12.2.141 

Also number of variables, quantifier rank, number of quantifiers, num¬ 
ber of V-symbols, and size of formulas in general, are all important com¬ 
plexity measures that are closely related to inductive definitions. We have 
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seen in Theorem 12.2.171 that the number of variables in an inductive defi¬ 
nition is related to the number of processors in the corresponding CRAM, 
and that quantifier rank and number of quantifiers increase with itera¬ 
tions in an inductive definition. In the discussion after Definition 13.3.81 
we have seen that the class of structures definable by the fixed-point of a 
first-order formula of depth t{n) is in F0y[2 < A n )] for some constant num¬ 
ber c. This shows roughly how many V-symbols an inductive definition 
of a certain depth needs. We have also seen that 

\ 2 

) , —-—] = F0[2 n ° (1) ] = PSPACE = FO(PFP) 
log n ) log n 

in Corollary 13.3.131 In his paper "Number of Quantifier is Better than 
Number of Tape cells "[7], Immerman showed that 

NSPACE[/(n)] C QN[ ({( g f ] C DSPACE[(/(n)) 2 ] 
and hence as a corollary we have 

QN[n° (1) ] = F0[2 n ° (1) ] = PSPACE = FO(PFP) 
this relates roughly number of quantifiers to depth. 


4.2 Open Problems 

We have left open some questions and a conjecture : 

(1) In Section 2.3, we asked whether it is possible for two formulas, 
positive in the relation variables through which the induction is 
made, that the computation of their simultaneous fixed-point takes 
more time or more iterations than the computation of one of their 
nested fixed-points. 

(2) In Section 3.4, we asked whether the fixed-point of the formula 
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<p Si *,(xi ,... ,x s ,yi,... ,y s , Z) can be expressed as the fixed-point of 
a formula of a smaller depth than n 2s . If not, then we have the 
strict hierarchy : 

IND[n 2 ] C IND [n 4 ] C IND[n 6 ] C .... 

(3) Directly after the previous question we asked the question : Is there 
a structure A for which r(s,*4.) is (||*4.|| + l) 2s ? 

The answer of this question should help answer the previous ques¬ 
tion. 

(4) Conjecture 13.4.101 which is a generalization of Theorem 13.4.81 

For every class K of finite structures, every polynomially bounded 
t(n), the following are equivalent: 

(i) K is t(n)-fixed-point bounded. 

(ii) For every s, there is a constant number c such that for every 
positive integer n, every structure A of size n in K, 

r(s, *4.) < ct(n). 

(iii) On K , every L^-formula is equivalent to an IND [t(n)]-formula. 
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